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POLYPEPTIDES OF G- COUPLED RECEPTOR PROTEINS, 
AND COMPOSITIONS AND METHODS THEREOF 

FIELD OF THE INVENTION 

The present invention relates to compounds, 
compositions and methods involving synthetic, isolated and/or 
recombinant G-protein coupled receptor polypeptides that 
comprise fragments and/or consensus peptides of G-protein 
coupled receptors . 

BACKGROUND OF THE INVENTION 

The membrane protein gene superfamily of G-protein 
coupled receptors (GPRs) has been characterized as having seven 
putative transmembrane domains. The domains are believed to 
represent transmembrane a- helices connected by extracellular or 
cytoplasmic loops. Of the 74 sequenced members of this 
G-protein receptor superfamily, the shortest sequence of 324 
amino acids represents the rat mas oncogene and the longest, of 
744 amino acids, represents the human thyroid- stimulating 
hormone (TSH) receptor. GPRs thus include a wide range of 
biologically active receptors, such as hormone-, viral-, growth 
factor- and neuroreceptors. 

G-protein coupled receptors have been characterized as 
including these seven conserved hydrophobic stretches of about 
20-30 amino acids, connecting at least 8 divergent hydrophilic 
loops. The G-protein family of coupled receptors includes 
dopamine receptors which bind in a noncovalent but high affinity 
manner to neuroleptic drugs used for treating psychotic and 
neurological disorders. For example, the dopamine D : receptor 
includes these transmembrane domains, two of which (TM III and 
TM V; see below) have been implicated by site- selective 
mutagenesis to demonstrate functional, association with D : 
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z. 

provide sequences which impart biological activity to GPRs . 
Most GPRs have single conserved cysteine residues in each of the 
first two extracellular loops which form disulfide bonds that 
are believed to stabilize functional protein structure. TM3 is 
5 also implicated in signal transduction. 

Phosphorylation and lipidation (palmitylation or 
f arnesylation) of cysteine residues can influence signal 
transduction of some GPRs . Most GPRs contain potential 
phosphorylation sites (e.g., serine or theronine residues) 
10 within the third cytoplasmic loop and/or the carboxy terminus. 

For several GPRs , such as the /3-adrenoreceptor , phosphorylation 
by protein kinase A and/or specific receptor kinases mediates 
receptor desensitization . 

Non- limiting examples of GPRs include cAMP receptors, 
15 adenosine receptors, ^-adrenergic receptors, muscarinic 

acetylcholine receptors, a-adrenergic receptors, serotonin 
receptors (5-HT), histamine H2 receptors, thrombin receptors, 
kinin receptors, follicle stimulating hormone receptors, opsins 
and rhodopsins, odorant receptors, cytomegalovirus receptor, 
20 etc. See e.g., Probst et al DNA and Cell Biology 11:1-20(1992), 

which is entirely incorporated herein by reference. 

The ligand binding sites of GPRs are believed to 
comprise a hydrophilic socket formed by several GPR 
transmembrane domains, which socket is surrounded by hydrophobic 
25 residues of the GPRs. The hydrophilic side of each GPR 

transmembrane helix is postulated to face inward and form the 
polar ligand binding site. TM3 has been implicated in several 
GPRs as having a ligand binding site, such as including the TM3 
aspartate residue. Additionally, TM5 serines, a TM6 asparagine 
3 0 and TM6 or TM7 phenylalanines or tyrosines are also implicated 

in ligand binding. 

GPRs can be intracellularly coupled by heterotrimeric 
G-proteins to various intracellular enzymes, ion channels and 
transporters. See, e.g., Johnson et al Endoc. Rev. 
35 10:317-331(1989) ; and Birnbaumer et al Biochem. Biophys . Acta 

1031:163-224(1990) which references are incorporated entirely 
herein by reference. GPR agonist binding catalyzes the exchange 
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of GTP for GDP on the a-subunit of the G-protein. Different 
G-protein a-subunits preferentially stimulate particular 
effectors to modulate various biological functions in a cell. 
Phosphorylation of cytoplasmic residues of GPRs has been 
5 identified as an important mechanism for the regulation of 

G-protein coupling of some GPRs. 

As a non - 1 imi t ing example of a GPR 1 igand , dopamine 
( 3 , 4 - dihydroxyphenethylamine ) is a critical neurotransmitter in 
the central nervous system (e.g., in the substantia nigra, 

10 midbrain, and hypothalamus) . Since the elucidation of the 

ascending mesolimbic and nigrostriatal pathways, these pathways 
have been found to be critical in the control of both motor 
initiation (nigrostriatal) behavior and affective (mesolimbic) 
behavior. The clinical efficacy of the major neuroleptic 

15 antipsychotic medications has been found to correlate with the 

respective affinities of these agents for the dopamine D 2 
receptor in the brain. A dopaminergic role in the 
symptomatology of the major psychoses has thus been 
hypothesized, although it is unclear if dopamine alone is 

20 etiological, (see, e.g., Davis et al . Am. J\ Psych. 148:1474- 

1476 (1991)) . Nonetheless, this hypothesis has served as a 
stimulus for current research in this area. 

One model for studying possible interactions of 
G-protein coupled receptors with their ligands has emerged from 

25 site-directed mutagenesis and biochemical analysis of the 

)3 - adrenergic receptor , as well as from biophysical analysis of 
the interaction of retinal with opsin. 

According to such a model, the binding of a GPR ligand 
to a G-protein coupled receptor involves multiple interactions 

3 0 between functional groups on the GPR ligand and residues within 

the hydrophophilic binding site of the receptor, 

While a number of the amino acid residues in the 
dopamine D : receptor have been postulated to participate in D : 
~ A cr a c 1 v j i n — ^ "H ct v-^.qp^ — -p ■=> ^ y 1 +- o b r r5 i "n ^ ci f 7- ott ^ i ^ ^ - H ^ v ^ r *- ^ p 1 



specifically dec ermine wnich residues are actually :::vc.ve: 
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binding in the D : system. Sibley et al . Soc. Neurosci. Abs . 
17:36.10, 324.5, 324.6 (1991). 

The clinical use of neuroleptics has provided a means 
for treating patients suffering from psychotic disorders. 
5 Short-term use of neuroleptics is indicated in several types of 

psychotic disorders, e.g., acute psychotic episodes, regardless 
of type; exacerbations of schizophrenia; acute manic excitement 
while deferring use of lithium or awaiting onset of its effects; 
adjunctive therapy for major depression with prominent psychotic 
symptoms, or when an antidepressant or ECT alone is not 
successful; for agitation in delirium, dementia, or severe 
mental retardation while seeking to identify and treat the 
primary basis of the problem; in certain chronic, degenerative, 
or idiopathic neuropsychiatric disorders with dyskinesias, such 
as Huntington's disease or Gilles de la Tourette's syndrome; or 
for ballism or hemiballism; childhood psychoses or apparently 
allied conditions marked by severe agitation or aggressive 
behavior; miscellaneous medical indications, notably nausea and 
vomiting, or intractable hiccups. 

Additionally, continuous long-term use of neuroleptics 
is indicated in many psychotic disorders, such ?.s (for more than 
six months) (i) primary indications such as Schizophrenia, 
Paranoia a,b , Childhood psychoses, some degenerative or idiopathic 
neuropsychiatric disorders (notably, Huntington's disease and 
Gilles de la Tourette's syndrome); (ii) secondary indications 
such as extremely unstable manic-depressive or other episodic 
psychoses (unusual), otherwise unmanageable behavior symptoms in 
dementia, amentia, or other brain syndromes; and (iii) 
questionable indications such as chronic characterological 
disorders with schizoid, "borderline, " or neurotic 
characteristics; substance abuse; or antisocial behavior, 
recurrent mood disorders. See, e.g., Baldessarini , Chemotherapy 
in Psychiatry, Revised and Enlarged Edition, Harvard University 
Press, Cambridge, MA, (1985) , the contents of which is entirely 
incorporated herein by reference. 

Neuroleptics are also referred to as neuroplegics, 
psychoplegics , psychoiept ics , antipsychotics and major 
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tranquilizers, buc are sometimes distinguished from non- 
neuroleptic anti -psychotics . Neuroleptics have recently been 
characterized as an agent that produces sedative or 
tranquilizing effects, and which also produces motor side 
5 effects, such as catalepsy or extrapyramidal symptomatology. 
Nonlimiting representative examples of neuroleptics include 
phenothiazine derivatives (e.g., chlorpromazine) ; thioxanthine 
derivatives (e.g., thiothixene); butyrophenone derivatives 
(e.g., haloperidol) ; dihydroindolone (e.g., molindone); 

10 dibenzoxazepine derivatives (e.g., loxapine) ; and "atypical" 
neuroleptics (e.g., sulpiride, remoxipiride pimozide and 
clozapine) . See Berstein Clinical Pharmacology Littleton, 
Mass.:PSG Publishing (1978); Usdin et al Clinical Pharmacology 
in Psychiatry New York: Elsevier North- Holland (1981) ; and 

15 Baldessarini, supra, (1985); and , which references are herein 
entirely incorporated by reference. 

The term "atypical neuroleptics" has been used to 
describe antipsychotic neuroleptics that produce few or no 
extrapyramidal side effects and which do not cause catalepsy in 

20 animals (See, e.g., Picket et al , Arch. Gen. Psychiatry 49 : 345 
(May 1992) . Alternatively, atypical neuroleptics, such as 
clozapine, have been described as those neuroleptics which have 
a higher affinity for D 4 and D, sites than for D 2 sites (See, 
e.g., Davis et al Amer. J. Psych. 148:1474, 1476 (November 

25 1991) . 

The long term use of all known anti-psychotics , such 
as neuroleptics or non- neuroleptic antipsychotics, has resulted 
in serious side effects, as present in Table I, such as 
persistent and poorly reversible motoric dysfunctions (e.g., 
30 tardive dyskinesia) in a significant number of patients. These 
side effects are especially prevalent in geriatric populations, 
and adequate pharmacological treatment of these debilitating 
motoric dysfunctions is not currently available. This problem 
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TABLE I 

Neurological Side Effects of 
Neuroleptic- Antipsychotic Drugs 



Reaction 



Features 



Period of Proposed mechanism Treatment 
maximum risk 



Acute dystonia 



Parkinsonism 



Akathisia 



Tardive 
dyskinesia 



"Rabbit" 
syndrome 



Malignant 
syndrome 



Spasm of musctes of 1 -5 days 

tongue, face, neck, 

back; may mimic 

seizures; not 

hysterica) 

Bradykinesia, rigidity, 
variable tremor, mask- 
facies, shuffling gait 

Motor restlessness; 
patient may experience 
anxiety or agitation 



Dopamine excess? 
Acetylcholine excess? 



Antiparkmsonism agents are 
diagnostic and curative 
(i.m. or i.v., then p.o.) 



5-30 days 
(rarely 
persists) 

5-60 days 
(commonly 
persists) 



6-24 months 
(worse on 
withdrawal) 



Oral-facial dyskinesia; 
choreo-etnetosis, some- 
times irreversible, 
rarely progressive 

Perioral tremor (late 
parkinsonism variant?); 
usually reversible 

Catatonia, stupor. Weeks 
fever, unstable pulse 
and blood pressure; 
myoglobinemia; can 
be fatal 



Months or 
years 



Dopamine blockade 



Unknown 



Dopamine excess? 



Unknown 



Unknown 



Antiparkmsonism agents 
(p.o); dopamine agonists 
risky? 

Reduce dose or change drug 
iow doses of propranolol;' 
antiparkinsontsm agents or 
or benzodiazepines may help 

Prevention best; treatment 
unsatisfactory; slow spontaneous 
remission 



Antiparkmsonism agents; reduce 
dose of neuroleptic 



Stop neuroleptic; antiparkmsonism 
agents usually fail; bromocriptine 
often helps; dantrolene variable; 
general supportive care crucial 



a. There may be an increased risk of hypotension on interacting high doses of propranolol with some antipsychotic 
agents; clonidine may also be effective at doses of 0.2-0.8 mg/day, but carries a high risk of hypotension (Zubenko et al., 
Psychiatry Res. 11:143, 1984). 

In addition, clozapine, although apparently capable of 
producing less motor side effects, can cause irreversible, 
potentially fatal agranulocytosis in a minority of patients 
administered the drug. Such serious side effects limit the use 
of 
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clozapine to patients who are resistant to treatment with other 
neuroleptics . 

Antipsychotics have a variety of significant 
pharmacological effects, e.g., as presented in the following 
Tables II and III. 

Table II 

Comparative Pharmacology of Neuroleptics 





Phenothiazine 


Th i oxanthene 


Butyrophenone 




Derivative 


Deri vat i ve 


Deri vat i ve 


Alkaloid 








Pharmacologic Actions 


Chlorpromazine 


Thiothixene 


Haloperidol 


Ant ipsychot i c 


Yes + * 


Yes + + 


Yes ■*■ + + + 


Antiemetic 


Yes + + + 


Not tested 


Yes + + + 


Hypothermia 


Yes + 


Yes + 


No 


Hypotension 


Yes + + 


Yes ♦ + + 


+ 


Parkinsonism 


Yes + * 


Yes + 


Yes + + + + 


Ant i adrenergic 


Yes + + 


Yes + + + 


+ 


Antichol inergic 


Yes + 


Yes + 


Negl igible 


Antihistaminic 


Yes + 


Negi igible 


Negl igible 


Releases NE . DA 


No 


No 


No 


Blocks DA 


Yes + + 


Yes + 


Yes + + + + 


Blocks NE 


Yes «■ + 


Yes + + + 


Yes + 


Central sympathetic 


Yes * + 


Yes + 


Yes + + + 



suppressant 



Chlorpromazine, thiothixene, and haloperidol decrease the functional availability of dopamine (DA) and norepinephrine (NE) by blocking the 
dopamine receptor arte* in the basal ganglia and norepinephrine receptor srtes in thalamic and hypothalamic areas. Reserpine simply reduces the 
concentration* of norepinephrine and dopamine in these areas. Both of these actions result in suppression of central sympathetic activity 
+ -»+-♦- + + indicates from very weak to very strong effects. 



Table III 

Comparative Pharmacology of Antipsychotics 



Extrapyramidal 
Drug 



Sedation 



Adrenergic 
Blockage 



Reaction 



Chlorpromazine 

Chlorprothixene 

Haloperidol 

Molindone 

Loxapine 



High 
High 
Low 

Moderate 
High 



Moderate to high 

High 

Low 

Moderate 

Low to moderate 



Moderate 

Low to moderate 

High 

Moderate to high 
High 



See Ebadi, PHARMACOLOGY , Little, Brown and Co., Boston, 61-65 
(1985); Cattabeni et al Adv. Biochem. Psychopharmacology 24:275 
(1980). Baldessarini , supra, which references are herein 
incorporated entirely by reference. 

However, despite the fact that thousands of neuroleptic - 
or ant insychot i c - t ype compounds have been synthesized and reported 
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Alternative to dopamine receptor GPRs, as presented 
above, other neuroreceptor GPRs are involved in neurological 
pathologies, and drugs such as neuroreceptor GPR binding agents, 
presently used for treating these pathologies, also suffer from 
5 similar side effects as those of neuroleptics, as presented above. 
Other GPRs are also involved in receptor- related 
pathologies, such as hormone related GPRs involved in endocrine 
related pathologies. 

Accordingly, there is a need to provide G- protein coupled 

10 receptor binding agents, including neuroreceptor and endocrine 
receptor GPRs, which do not produce such deleterious and 
debilitating side effects as those produced by known agents, such 
as neuroleptics, which can be used for therapy or diagnosis of GPR 
related pathologies. 

15 Citation of documents herein is not intended as an 

admission that any of the documents cited herein is pertinent 
prior art, or an admission that the cited documents are considered 
material to the patentabilty of the claims of the present 
application. All statements as to the date or representations as 

2 0 to the contents of these documents are based on the information 
available to the applicant and does not constitute any admission 
as to the correctness of the dates or contents of these documents . 

SUMMARY OF THE INVENTION 

It is therefore an object of the present invention to 

2 5 overcome one or more deficiencies found in the related art. 

It is another object of the present invention to provide 
non-naturally occurring synthetic, isolated and/o^ recombinant GPR 
polypeptides which are fragments, consensus fragments and/or 
sequences having conservative amino acid substitutions, of at 

3 0 least one transmembrane domain of at least one G-protein coupled 

receptor, which polypeptides have been discovered to have 
receptor- like functional binding sites of neuroreceptor and 
endocrine GPRs, such that GPR polypeptides of the present 
invention may bind GPR ligands, or which may also modulate, 
35 quantitatively or qualitatively, GPR ligand binding to GPRs. 
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It is still another object of the present invention to 
provide GPR polypeptides and compositions that have only partially 
helical structures, in contrast to known characterized 
transmembrane domains of GPRs, such as, but not limited to, GPR 
5 transmembrane domains I -VII. 

It is yet another object of the present invention to 
provide synthetic or recombinant GPR polypeptides, conservative 
substitution derivatives thereof , antibodies , anti - idiotype 
antibodies, compositions and methods that can be used as potential 

10 modulators of G-protein coupled receptor function, by binding to 
GPR ligands or modulate GPR ligand binding, due to their expected 
biological properties, which may be used in diagnostic, 
therapeutic and/ or research applications . 

It is a further ob j ect of the present invention is to 

15 provide synthetic, isolated or recombinant polypeptides which are 
designed to inhibit or mimic various GPRs or fragments thereof, as 
receptor types and subtypes . 

According to one aspect of the present invention, a 
synthetic or recombinant GPR polypeptide is provided that 

20 comprises a GPR amino acid sequence of, e.g., at least 5, 10, 15 
or 20 amino acids, substantially corresponding to at least one 
transmembrane domain , or fragment and/ or consensus peptide 
thereof, of a G-protein coupled receptor, wherein at least 20 
amino acids are preferred. In a preferred embodiment, the 

25 polypeptide is (a) chemically synthesized and/or (b) obtained from 
a recombinant host cell or organism which expresses a recombinant 
nucleic acid encoding a GPR polypeptide, as defined herein. 

In another preferred embodiment, the transmembrane domain 
is selected from at least one of TM1 , TM2 , TM3 , TM4 , TM5 , TM6 or 

30 TM7 , corresponding to transmembrane domains I, II, III, IV, V, VI 
and VII, respectively, of a GPR. In another preferred embodiment, 
the transmembrane domain is a dopamine receptor transmembrane 
domain selected from the group consisting of at least one of a D, , 
Tj Tj t ij 3.nd d cc am ine receiptor era n sm^mb rsnG d oma i r Th p- - 

preferred embodiment, zne ^PR pc_.ypept.iae ammo acid sequence 
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substantially corresponding to an amino acid sequence contained in 
at least one of Fig. 2 (SEQ ID N0:2), Fig. 3 (SEQ ID NO:3) or Fig. 
5 (SEQ ID NO: 5) . 

In another aspect of the present invention, a GPR 
5 composition is provided, comprising a GPR polypeptide, or a 
pharmaceutically acceptable ester, ether, sulfate, carbonate, 
malate, glucuronide or salt thereof, the composition further 
comprising a pharmaceutically acceptable carrier and/or diluent. 

In still another aspect of the present invention, a 

10 method is provided for treating a subject suffering from a disease 
state involving a qualitative or quantitative pathological 
abnormality of a GPR protein or a biological molecule functionally 
associated therewith. Such biological molecule may be a membrane 
cytoplasmic protein, lipid, carbohydrate, saccharide, nucleoside 

15 or nucleotide mono-, di-, or tri - phosphate , an enzyme, a co- 
factor, a nucleic acid, a neurotransmitter, an ion, a carrier, a 
cell receptor, or any combination thereof. 

In a preferred embodiment, the GPR protein is a dopamine 
receptor and the abnormality involves a dopamine related 

20 pathology, wherein the method comprises administering an effective 
dopamine receptor modulating amount of a GPR polypeptide of the 
present invention. In another preferred embodiment, the 
transmembrane domain is a D 2 dopamine receptor domain and the 
disease state is a psychiatric disorder, such as schizophrenia or 

25 schiz affective disorder (see American Psychiatric Association, 
Revised Manual of Diagnostic and Statistical Criteria for 
Psychiatric Disorders (DSM-III-R) , American Psychiatric Assoc. 
Press, Washington, DC (1989)). 

In another preferred embodiment, the GPR composition is 

3 0 administered as a pharmaceutical composition to provide a GPR 

polypeptide in an amount ranging from about 0.01 /xg to 100 mg/kg, 
and also preferably, about 10 /xg to 10 mg/kg. In another 
preferred embodiment, the administering is by oral, intravenous , 
intramuscular, parenteral or topical administration, including 

3 5 mucosal administration to the nasal mucosa or the oral mucosa, by 
aerosol, nebulizer or drop administration as non- limiting 
examples . 
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Other objects of the invention will be apparent to 
skilled practitioners from the following detailed description and 
examples relating to the present invention. 



5 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is the amino acid sequence of a control peptide 
(SEQ ID NO:l), which is hydrophobic in its properties, but does 
not correspond to a known GPR transmembrane domain. 

Fig. 2 represents the amino acid sequence of a GPR 
10 transmembrane polypeptide, polypeptide II (SEQ ID NO:2), which 

corresponds to a portion of the dopamine D : receptor transmembrane 
segment III. 

Fig. 3 represents the amino acid secruence of a 
transmembrane polypeptide, polypeptide III (SEQ ID NO:3), 
15 corresponding to a consensus peptide of the dopamine D 2 receptor 
transmembrane domains I -VII . 

Fig. 4 represents the amino acid sequence of a consensus 
sequence of transmembrane domains that is shortened to be less 
than the length required to span a lipid bilayer. 
2 0 Fig. 5 represents a consensus amino acid sequence of 

transmembrane domain as a consensus peptide between dopamine 
receptors D 1 and D 2 . 

Fig. 6 is a representation of a circular dichroism 
spectrum of a solution of the consensus polypeptide III (SEQ ID 
25 NO:3) of Fig. 3. 

Fig. 7 is a graphical representation of radioligand 
binding assay data comparing control polypeptide II (SEQ ID NO:l) 
of Fig . 1 , labeled as "11" and consensus polypeptide I (SEQ ID 
NO: 3) of Fig. 3, labeled as "I" . 
30 Fig. 8A-G are a comparison listing of amino acid 

sequences of transmembrane domains and adjacent amino acid 
sequences of representative GPRs (SEQ ID NOS:6-79). 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention relates to G-protein coupled 
receptor (GPR) polypeptides which can be used to mimic naturally 
occurring or isolated GPRs, or to modulate the binding of GPR 
5 ligands to GPRs , such as inhibition or enhancement of binding. 
GPR polypeptides of the present invention can include GPR 
transmembrane domain fragments and/or consensus peptides thereof, 
of at least 4-10 amino acids in length, and/or corresponding 
sequences having conservative amino acid substitutions as 

10 "substitution peptides", wherein the GPR polypeptide binds a GPR 
ligand or modulates the binding of a GPR ligand to a GPR in vitro, 
in vivo or in situ. 

GPR polypeptides of the present invention can be 
synthesized or recombinantly produced, or optionally purified, to 

15 provide commercially useful amounts of GPR polypeptides for use in 
therapeutic, diagnostic or research applications, according to 
known method steps, see, e.g., Ausubel et al, eds. Current 
Protocols in Mo lecular Biology . Wiley Interscience , N.Y., (1987, 
1992); and Sambrook et al , Molecular Cloning , A Laboratory Manual . 

20 2nd edition, Vols. 1-3, Cold Spring Harbor Press, (1989), which 
references are herein entirely incorporated by reference. 

Additionally, GPR polypeptides according to the present 
invention can be used to generate polyclonal and/or monoclonal 
antibodies, anti - idiotype antibodies thereto, or fragments 

25 thereof, which may used for diagnostic and/or therapeutic 

applications, according to known method steps, see, e.g., Harlow 
and Lane, Antibod ies: A Laboratory Manual , Cold Spring Harbor 
Press (1988), which is herein entirely incorporated by reference. 

GPR polypeptides, anti-GPR antibodies or anti - idiotype 

3 0 antibodies (or fragments thereof) to GPR polypeptides have been 
unexpectedly discovered to quantitatively or qualitatively 
modulate G-protein coupled receptors, such that binding of GPR 
polypeptides or anti - idiotype antibodies (or fragments thereof) to 
G-protein coupled receptor ligands may be used for diagnostic 

35 research or therapeutic applications of the present invention. 
Such GPR polypeptides, antibodies or anti - idiotype antibodies of 
the present invention may therefore be used as modulators of 
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G-protein coupled receptors, such as neuroreceptors or endocrine 
receptors, as non- limiting examples. 

Binding of such GPR polypeptides, (including GPR 
fragments, consensus peptides, substitution derivatives and anti- 
5 idiotype antibody fragments) of the present invention may be used 
to treat symptoms of, and provide diagnosis and treatment for, 
pathologies related to GPRs . Such pathologies have been found to 
correlate with symptoms occurring in neurological, viral or 
endocrine pathologies. D 2 receptor- related psychotic disorders, 

10 including schizophrenia, now treated with neuroleptics, is a non- 
limiting example thereof . 

The use of synthetic or recombinant GPR polypeptides of 
the present invention can be preferable to the use of known drugs 
that bind G-protein coupled receptors, such as neuroleptics that 

15 bind or inhibit the biological effect of binding to neuroreceptors 
as a non-limiting example. Such polypeptides are expected to have 
significantly less side effects than presently used drugs 
presently used for inhibiting such receptor binding including 
neuroleptics, as they would structurally mimic naturally occuring 

2 0 GPRs and/or modulate ligand binding. Thus, GPR polypeptides are 
expected to have reduced side effects attributable to known 
foreign compound drugs, with less immunogenicity , and reduced 
potential for motoric side effects (e.g., extrapyramidal symptoms 
and/or tardive dyskinesia) . 

25 The present invention is also related to the production, 

by chemical synthesis or recombinant DNA technology, of GPR 
polypeptides, preferably as small as possible while still 
retaining sufficiently high affinity or interaction with G-protein 
coupled receptors to modulate, such as to inhibit or to enhance, 

30 binding to such receptors by GPR ligands. 

GPR polypeptides of the present invention may include 5- 
10 to 50-150 amino acid fragments , consensus sequences or 
substitution sequences of GPRs, e.g., as presented in Fig. 8A-G 



receptors, serotonin receptors ;5-H7;, histamine H2 receptors, 
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10 



thrombin receptors, kinin receptors, follicle stimulating hormone 
receptors, opsins and rhodopsins, odorant receptors, 
cytomegalovirus GPRs , adenosine A2 receptors, dopamine receptor, 
histamine H2 receptors, octopanmine receptors, N-formyl receptors, 
anaphylatoxin receptors, thromboxane receptors, IL-8 receptors, 
platelet activating factor receptors, endothelin receptors, 
bombesin gastrin releasing peptide receptor, neuromedin B 
preferring bombesin receptors, vasoactive intestinal peptides, 
neurotensin receptors, bradykinin receptors, thyrotropin- releasing 
hormone receptors, substance P receptors, neuromedin K receptors, 
adrenal angiotensen II type I receptors, mas oncogene 
(angiotensin) receptors lutropin- choriogonadotropin receptors, 
thyrotropin receptors, follicle stimulating hormone receptors, 
cannabinoid receptors, glucocorticoid- induced receptors, 
15 endothelial cell GPRs, testis GPRs, and thoracic aorta GPRs, and 
homologs thereof having a homology of at least 80°; with at least 
one of transmembrane domains 1-7, as described herein. See, e.g., 
Probst et al DNA and Cell Biology 11:1-20(1992), which is entirely 
incorporated herein by reference. 

Accordingly, a "G-protein coupled receptor polypeptide" 
or "GPR polypeptide" of the present invention includes 
polypeptides having a "GPR amino acid sequence" which 
substantially corresponds to at least one 10 to 50 amino acid 
fragment and/or consensus sequence of a known GPR or group of 
GPRs, wherein the GPR polypeptide has homology of at least 8 0%, 
such as 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 98, 99 or 100% homology, while maintaining GPR 
modulating activity, wherein a GPR polypeptide of the present 
invention is not naturally occurring or is naturally occurring but 
is in a purified or isolated form which does nor occur in nature. 
Preferably, a GPR polypeptide of the present invention 
substantially corresponds to a transmembrane domain of a GPR or 
group of GPRs as a consensus sequence. 

Also preferred are GPR polypeptides wherein the GPR amino 
acid sequence is 4-10 to 50 amino acids in length, such as 5, 6, 
7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, ^0, 21, 22, 23, 
24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 



20 



25 



30 



35 
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40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 
110, 120, 130, 140 or 150 amino acids , or any range therein . 

An amino acid or nucleic acid sequence of a GPR 
polypeptide of the present invention is said to "substantially 
5 correspond" to another amino acid or nucleic acid sequence, 

respectively, if the sequence of amino acids or nucleic acid in 
both molecules provides polypeptides having biological activity 
that is substantially similar, qualitatively or quantitatively, to 
the corresponding fragment of at least one GPR transmembrane 

10 domain, or which may be synergistic when two or more transmembrane 
domains, consensus sequences or homologs thereof are present. 

Additionally or alternatively, such "substantially 
corresponding" sequences of GPR polypeptides include conservative 
amino acid or nucleotide substitutions, or degenerate nucleotide 

15 codon substitutions wherein individual amino acid or nucleotide 
substitutions are well known in the art. 

Alternatively or additionally, substantially 
corresponding refers to GPR polypeptides having amino acid 
sequences having at least 80% homology or identity to an amino 

20 acid sequence of SEQ ID NO:l, such as 80, 81, 82, 83, 84, 85, 86, 
87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% 
homology or identity . 

Accordingly, GPR polypeptides of the present invention, 
or nucleic acid encoding therefor, include a finite set of 

25 substantially corresponding sequences as substitution peptides or 
polynucleotides which can be routinely obtained by one of ordinary 
skill in the art , without undue experimentation, based on the 
teachings and guidance presented herein. For a detailed 
description of protein chemistry and structure, see Schulz, G.E. 

30 et al . , Principles of Protein Structure, Springer- Verlag , New 
York, 1978, and Creighton, T.E., Proteins: Structure and 
Molecular Properties, W.H. Freeman & Co., San Francisco, 1983, 
which are hereby incorporated by reference . For a presentation of 
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Conservative substitutions of a GPR polypeptide of the 
present invention includes a variant wherein at least one amino 
acid residue in the polypeptide has been conservatively replaced 
by a different amino acid. Such substitutions preferably are made 
5 in accordance with the following list as presented in Table IV, 
which substitutions may be determined by routine experimentation 
to provide modified structural and functional properties of a 
synthesized polypeptide molecule, while maintaining the receptor 
binding, inhibiting or mimicking biological activity, as 
10 determined by known GPR receptor activity assays. 



Table IV 



Oriq-inal 


Exemplary 


Residue 


Substitution 


Ala 


Gly; Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Ala; Pro 


His 


Asn; Gin 


He 


Leu; Val 


Leu 


Ile;Val 


Lys 


Arg; Gin; Glu 


Met 


Leu; Tyr; He 


Phe 


Met ; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp ; Phe 


Val 


He; Leu 



Alternatively, another group of substitutions of GPR 
polypeptides of the present invention are those in which at least one 
amino acid residue in the protein molecule has been removed and a 
different residue inserted in its place according to the following 
5 Table V. The types of substitutions which may be made in the 
protein or peptide molecule of the present invention may be based on 
analysis of the frequencies of amino acid changes between a 
homologous protein of different species, such as those presented in 
Table 1-2 of Schulz et al . , supra and Figs. 3-9 of Creighton, supra . 
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Based on such an analysis, alternative conservative substitutions are 
defined herein as exchanges within one of the following five groups: 

TABLE V 

1. Small aliphatic, nonpolar or slightly polar residues: Ala Ser 
Thr (Pro, Gly) ; 

2. Polar, negatively charged residues and their amides: Asp, Asn 
Glu, Gin; 

3. Polar, positively charged residues: 
His , Arg, Lys ; 

4. Large aliphatic, nonpolar residues: 
Met, Leu, lie, Val (Cys) ; and 

5 . Large aromatic residues : Phe , Tyr , Trp . 

The three amino acid residues in parentheses above have 
special roles in protein architecture. Gly is the only residue 
lacking any side chain and thus imparts flexibility to the chain. 
This however tends to promote the formation of secondary structure 
5 other than Of- helical. Pro, because of its unusual geometry, tightly 
constrains the chain. It generally tends to promote 0- turn- like 
structures, although in some cases Cys can be capable of 
participating in disulfide bond formation which is important in 
protein folding. Note the Schulz et al . would merge Groups 1 and 2, 

10 above. Note also that Tyr, because of its hydrogen bonding 
potential, has significant kinship with Ser, and Thr, etc. 

Conservative amino acid substitutions according to the 
present invention, e.g., as presented above, are known in the art and 
would be expected to maintain biological and structural properties 

15 of the polypeptide after amino acid substitution. Most deletions and 
insertions, and substitutions according to the present invention are 
those which do not produce radical changes in the characteristics of 
the protein or peptide molecule. "Characteristics" is defined in a 
non- inclusive manner to define both changes in secondary structure, 

20 e.g. a-helix or /3-sheet, as well as changes in physiological 
activity, e.g. in receptor binding assays. 

However, when the exact effect of the substitution, 
deletion, or insertion is to be confirmed one skilled in the art will 
appreciate that the effect of the substitution or substitutions will 

^ ho ,'2 1 ll^^-n.^ Kif -v- ~ - - - >- ~ .— , -v- -, ^ ^ ^ -,. ... _ J 1_ _ _ 
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typically is made by sice- specif ic mutagenesis of the peptide 
molecule- encoding nucleic acid, expression of the mutant nucleic acid 
in recombinant cell culture, and, optionally, purification from the 
cell culture, for example, by immunoaf f inity chromatography using a 
5 specific antibody on a chemically derivatized column or immobilized 
membranes or hollow fibers (to absorb the mutant by binding to at 
least one epitope) . 

A preferred use of this invention is the production, by 
chemical or recombinant DNA technology, of GPR polypeptides, 
10 preferably as small as possible while still retaining sufficiently 
high affinity for binding to, or association with, GPRs. By 
production of GPR polypeptides including smaller fragments or 
variants of such transmembrane domains, one skilled in the art, using 
known binding and inhibition assays, can readily identify the GPR 
15 polypeptides capable of binding minimizing or modulating G-protein 
coupled receptors using known methods. Non-limiting examples of 

fragments of GPRs to be used as GPR polypeptides or as a basis for 
consensus sequences thereof for GPR polypeptides, are presented in 
Figs. 2-5 and Fig. 8A-G, wherein fragments or consensus sequences of 
20 10 to 50 amino acids of at least one sequence of Figs. 2-5 or 
corresponding to at least one transmembrane domain or domains 1-7 
listed in Fig. 8A-G (SEQ ID NOS:6-79) are encompassed by the present 
invention, such as at least one transmembrane domain of one or more 
GPRs, such as a cAMP receptor (l), adenosine receptors (2-3); 
25 muscarinic acetylcholine receptors (4-8) ; human adrenergic receptors 
(9-11, 14-16, 19-25, 28); adrenergic receptors (9-28); human thrombin 
receptor (31); endothelin receptors (35-36), bombesin receptors (37- 
38), endocrine receptors (48-50), rhodopsin (51). opsins (52-54), 
odorant receptors (55-64), and cytomegalovirus GPRs (72-54), as non- 
30 limiting examples, wherein (••#") refers to the listed sequences in 
Fig. 8A-G. 

Accordingly, GPR polypeptides may include consensus 
sequences and/or fragments of at least one of transmembrane domain 
1-7 of one or more GPRs as presented in Figs. 2-5 (SEQ ID NO:2-5) or 
35 Fig. 8A-G. (SEQ ID NOS:6-79) or homologs thereof, which GPR 
polypeptides do not occur naturally, and/or which are provided in an 
isolated and/or purified form not found in nature. 
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Consensus peptides of GPR polypeptides of the present 
invention may include peptides which are distinct from known GPR 
sequences in critical structural features, but which are derived from 
consensus sequences of homologous GPR transmembrane domains 1-7, 
e.g., as presented in Fig. 8A-G (SEQ ID NOS:6-79). Such consensus 
peptides may be derived by molecular modeling, optionally combined 
with hydrophobicity analysis and/or fitting to model helices, as 
non- limiting examples. Such modeling can be accomplished according 
to known method steps using known modeling algorithms, such as, but 
not limited to, ECEPP, INSIGHT, DISCOVER, CHEM- DRAW, AMBER, FRODO and 
CHEM-X. Such algorithms compare transmembrane domains between 
related G-protein coupled receptors, determine probable energy - 
miminized structures and define alternative consensus polypeptide 
fragments . 

Such consensus peptides or fragments of GPRs may then be 
synthesized or produced recombinant ly, in order to provide GPR 
polypeptides according to the present invention which mimic, modulate 
or inhibit binding of ligands to G-protein coupled receptors. GPR 
ligands, in the context of the present invention, refer to biological 
molecules that bind GPRs in vitro, in situ or in vivo, and may 
include hormones, neurotransmitters, viruses or receptor binding 
domains, thereof, opsins, rhodopsins, nucleosides, nucleotides, 
coagulation cascade factors, odorants or pheremones, toxins, colony 
stimulating factors, platelet activating factors, neuroactive 
peptides, neurohumors , or any biologically active compounds, such as 
drugs or synthetic or naturally occurring compounds. 

The following non- limiting examples of consensus peptides 
of GPRs of the present invention are provided by way of guidance and 
not by way of limitation. In GPR polypeptides of the present 
invention, one or more, preferably 4-10, Asp and/or Lys residues may 
additionally be incorporated at the carboxy and/or amino terminal 
ends in order to provide expected helix forming effects of the helix 
dipole effect, e.g., as described in Baldwin et al Biochew. 28:2130 
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As a non- limiting example of GPR polypeptide of the present 
invention, dopamine receptor transmembrane fragments of D. 
transmembrane domain (e.g., domain III) as presented in Fig. 2 (SEQ 
ID NO: 2) or a consensus sequence as presented in Fig. 3 (SEQ ID 
5 NO:3), e.g., of D 2 domains I-VII. Additionally or alternatively a 
consensus sequence may include less than 20 amino acids, such as 15 
amino acids corresponding to a transmembrane domain, such as a D, 
receptor domain, as presented in Fig. 4 (SEQ ID NO : 4 ) as polypeptide 
IV, which is smaller than the length required by spanning an average 

10 lipid bilayer of a cell membrane. 

However, in the context of the present invention, GPR 
polypeptides of greater than 15 -20 amino acids are preferred such 
that the GPR polypeptides are able to span the lipid bilayer. 

Another non- limiting example of a GPR polypeptide using 

15 dopamine receptor transmembrane domains is a consensus sequence of 
two or more GPR receptors, such as the dopamine B l and D 2 receptors. 
A non- limiting example of such a consensus GPR polypeptide is 
presented in Fig. 5 {SEQ ID NO:5). 

Additionally, modified amino acids or chemical derivatives 

20 of amino acids of consensus or fragments of GPRs proteins, according 
to the present invention may be provided, which polypeptides contain 
additional chemical moieties or modified amino acids not normally a 
part of the protein. Covalent modifications of the peptide are thus 
included within the scope of the present invention. Such 

25 modifications may be introduced into a GPR polypeptide by reacting 
targeted amino acid residues of the polypeptide with an organic 
derivatizing agent that is capable of reacting with selected side 
chains or terminal residues. The following examples of chemical 
derivatives are provided by way of illustration and not by way of 

30 limitation. 

Aromatic amino acids may be replaced with D- or 
L-naphylalanine, D- or L- Phenylglycine , D- or L-2-thieneylalanine, 
D- or L-1-, 2-, 3- or 4 -pyreneylalanine , D- or L- 3 - thieneylalanine , 
D- or L- (2-pyridinyl) -alanine, D- or L- ( 3 -pyridine 1 ) - alanine , D- or 

35 L- (2 -pyrazinyl) -alanine, D- or L- (4 - isopropyl) -phenylglycine, 
D- (trif luoromethyl) -phenylglycine, D- ( trif luoromethyl ) -phenylalanine, 
D-p-f luorophenylalanine, D- or L-p-biphenylphenylalanine , D- or 
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L-p-methoxybiphenylphenylalanine , D - or L- 2 - indole (alkyl ) alanines , 
and D- or L - alkylainines where alkyl may be substituted or 
unsubstituted methyl, ethyl, propyl, hexyl , butyl, pentyl , iso- 
propyl , iso-butyl, sec-isotyl, iso-pentyl, non-acidic amino acids, 
5 of C1-C20. 

Acidic amino acids can be substituted with non- carboxylate 
amino acids while maintaining a negative charge, and derivatives or 
analogs thereof, such as the non- limiting examples of (phosphono) - 
alanine, glycine, leucine, isoleucine, threonine, or serine; or 

10 sulfated (e.g., -S0 3 H) threonine, serine, tyrosine. 

Other substitutions may include unnatural hyroxylated amino 
acids may made by combining "alkyl" (as defined and exemplified 
herein) with any natural amino acid. Basic amino acids may be 
substituted with alkyl groups at any position of the naturally 

15 occurring amino acids lysine, arginine, ornithine, citrulline, or 
(guanidino) -acetic acid, or other (guanidino) alkyl - acetic acids, 
where "alkyl " is define as above . Nitrile derivatives (e.g. , 
containing the CN- moiety in place of COOH) may also be substituted 
for asparagine or glutamine , and methionine sulfoxide may be 

20 substituted for methionine. Methods of preparation of such peptide 
derivatives are well known to one skilled in the art . 

In addition, any amide linkage in any of the GPR 
polypeptides can be replaced by a ketomethylene moiety, e.g. (-C(=0) - 
CH 2 - ) for ( - (C=0) -NH- ) . Such derivatives are expected to have the 

2 5 property of increased stability to degradation by enzymes , and 

therefore possess advantages for the formulation of compounds which 
may have increased in vivo half lives, as administered by oral, 
intravenous , intramuscular, intraperitoneal , topical , rectal , 
intraocular, or other routes . 

3 0 In addition, any amino acid representing a component of the 

said peptides can be replaced by the same amino acid but of the 
opposite chirality. Thus, any amino acid naturally occurring in the 
L- configuration (which may also be referred to as the R or S, 



which can additionally be referred tc as the R- or :ne S- , dependmc 
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upon its composition and chemical configuration. Such derivatives 
have the property of greatly increased stability to degradation by 
enzymes, and therefore are advantageous in the formulation of 
compounds which may have longer in vivo half lives, when 
5 administered by oral, intravenous, intramuscular, intraperitoneal, 
topical, rectal, intraocular, or other routes. 

Additional amino acid modifications of amino acids of GPR 
polypeptides of to the present invention may include the following: 
Cysteinyl residues may be reacted with alpha-haloacetates {and 

10 corresponding amines) , such as 2 - chloroacetic acid or 
chloroacetamide, to give carboxymethyl or carboxyamidomethyl 
derivatives. Cysteinyl residues may also be derivatized by reaction 
with compounds such as bromotrif luoroacetone, alpha-bromo- 
beta- (5 -imidozoyl) propionic acid, chloroacetyl phosphate, 

15 N-alkylmaleimides, 3 -nitro-2-pyridyl disulfide, methyl 2-pyridyl 
disul f ide , p- chloromercuribenzoate , 2 - chloromercuri - 4 - nitrophenol , 
or chloro- 7 - nitrobenzo-2 - oxa- 1 , 3 -diazole . 

Histidyl residues may be derivatized by reaction with 
compounds such as diethylprocarbonate e.g., at pH 5.5-7.0 because 

20 this agent is relatively specific for the histidyl side chain, and 
para-bromophenacyl bromide may also be used; e.g. , where the reaction 
is preferably performed in 0.1 M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues may be reacted with 
compounds such as succinic or other carboxylic acid anhydrides. 

25 Derivatization with these agents is expected to have the effect of 
reversing the charge of the lysinyl residues. Other suitable 
reagents for derivatizing alpha-amino- containing residues include 
compounds such as imidoesters/e . g . , as methyl picolinimidate ; 
pyridoxal phosphate ; pyridoxal ; chl o rob or ohydr ide ; 

30 trinitrobenzenesulf onic acid; O-methylisourea ; 2,4 pentanedione ; and 
transaminase- catalyzed reaction with glyoxylate. 

Arginyl residues may be modified by reaction with one or 
several conventional reagents, among them phenylglyoxal , 
2 , 3 -butanedione, 1 , 2 - cyclohexanedione , and ninhydrin according to 

35 known method steps. Derivatization of arginine residues requires 
that the reaction be performed in alkaline conditions because of the 
high pKa of the guanidine functional group. Furthermore, these 
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reagents may react with the groups of lysine as well as the arginine 
epsi Ion- amino group . 

The specific modification of tyrosyl residues per se is 
well-known, such as for introducing spectral labels into tyrosyl 
5 residues by reaction with aromatic diazonium compounds or 
tetranitromethane . N-acetylimidizol and tet rani tromethane may be 
used to form 0-acetyl tyrosyl species and 3-nitro derivatives, 
respectively . 

Carboxyl side groups (aspartyl or glutamyl) may be selec- 

10 tively modified by reaction with carbodiimides ( R 1 N-C-N-R 1 ) such as 
l-cyclohexyl-3- (2 -morpholinyl - (4-ethyl) carbodiimide or 1- 
ethyl- 3 - (4 -azonia-4 , 4 - dimethylpentyl ) carbodiimide . Furthermore, 
aspartyl and glutamyl residues may be converted to asparaginyl and 
glutaminyl residues by reaction with ammonium ions. 

15 Glutaminyl and asparaginyl residues may be frequently 

deamidated to the corresponding glutamyl and aspartyl residues. 
Alternatively, these residues may be deamidated under mildly acidic 
conditions. Either form of these residues falls within the scope of 
the present invention . 

2 0 Derivatization with bif unctional agents is useful for 

cross -linking the peptide to a water- insoluble support matrix or to 
other macromolecular carriers, according to known method steps. 
Commonly used cross - 1 inking agents include, e.g., 
1, l-bis (diazoacetyl) -2-phenylethane, glutaraldehyde, 

25 N-hydroxysuccinimide esters, for example, esters with 
4 -azidosalicylic acid, homobif unctional imidoesters , including 
disuccinimidyl esters such as 3,3'- 
dithiobis ( succinimidylpropionate) , and bifunctional maleimides such 
as bis-N-maleimido- 1, 8-octane. Derivatizing agents such as 

30 methyl -3- [ (p- azidophenyl ) dithio] propioimidate yield photoact ivatable 
intermediates that are capable of forming crosslinks in the presence 
of light. Alternatively, reactive water- insoluble matrices such as 
cyanogen bromide- activated carbohydrates and the reactive substrates 
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Other modifications of GPR polypeptides of the present 
invention may include hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of seryl or threonyl residues, 
methylation of the alpha- amino groups of lysine, arginine, and 
5 histidine side chains (T.E. Creighton, Proteins: Structure and 
Molecule Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 
(1983)), acetylation of the N-terminal amine, methylation of main 
chain amide residues (or substitution with N-methyl amino acids) and, 
in some instances, amidation of the C- terminal carboxyl groups, 

10 according to known method steps. 

Such derivatized moieties may improve the solubility, 
absorption, permeability across the blood brain barrier biological 
half life, and the like. Such moieties or modifications of GPR 
polypeptides may alternatively eliminate or attenuate any possible 

15 undesirable side effect of the protein and the like. Moieties 
capable of mediating such effects are disclosed for example, in 
Remington' s Pharmaceutical Sciences, 16th ed. , Mack Publishing Co., 
Easton, PA (1980) . 

Such chemical derivatives of GPR polypeptides also may 

20 provide attachment to solid supports, including but not limited to, 
agarose, cellulose, hollow fibers, or other polymeric carbohydrates 
such as agarose, cellulose, such as for purification, generation of 
antibodies or cloning; or to provide altered physical properties, 
such as resistance to enzymatic degradation or increased binding 

25 affinity or modulation for GPRs, which is desired for therapeutic 
compositions comprising GPR polypeptides, antibodies thereto or 
fragments thereof. Such peptide derivatives are well-known in the 
art, as well as method steps for making such derivatives using 
carbodiimides active esters of N- hydroxy succinimmide , or mixed 

3 0 anhydrides, as non- limiting examples. 

Variation upon consensus peptide sequences of GPR 
polypeptide of the present invention may also include: the addition 
of one, two, three, four, or five lysine, arginine or other basic 
residues added to the -C00H terminal end of the peptide; and/or one, 

35 two, three, four, or five glutamate or aspartate or other acidic 
residues added to the amino terminal end of the peptide, where 
"acidic" and "basic" are as defined herein. Such modifications are 
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well known to increase the a- helical content of the peptide by the 
"helix dipole effect". They also can provide enhanced aqueous 
solubility of the peptide. See, e.g., Baldwin et al . , supra 

As another non- limiting example of a GPR polypeptide of the 
5 present invention, serotonergic receptors (5-HT) consensus sequences 
may be determined using presently known 5-HT sequences and include, 
e.g., as consensus peptides of TM3 , TM5 and TM7 , ■ espect ively : 

5-HT consensus (1) DDDDNIWSIFDWIGYLNSISMVIYTLFKKKK (SEQ ID NO: 80) 
5-HT consensus (2) DDDDNIWNIFSTIGYLNSISPVSVIMHIYGKKKK {SEQ ID NO: 81) 
10 5-HT consensus (3) DDDDGYS IYDTLVTFAINPVYITVFKKKK (SEQ ID NO: 82) 

Such non- naturally occurring consensus sequences may also 
be further modified according to known method steps to provide 
additional consensus peptides with substituted amino acids to 
increase or decrease a-helical propensity and/or solubility (e.g., 
15 hydrophilicity) . As a non-limiting example, 5-HT consensus peptide 
(1) above may be modified according to the present invention to have 
increase helical propensity and increased aqueous solubility as 
follows : 

5-HT consensus (4) DDDDNAWS AFD WALYLNS I SMAI YTYAKKKK (SEQ ID NO: 83) , 

20 wherein, e.g., smaller, non-polar residues replace either larger, 
more polar residues (e.g., Ala for lie or Val) or larger aromatic 
residues (e.g., Ala for Phe) . 

Another non- limiting , illustrative example of consensus GPR 

polypeptides of the present invention are those for adrenergic 
25 receptors, are the following: 

An example of the consensus GPR polypeptide for 
domain VII across all presently known adrenergic receptors is as 
follows : 

adrenergic consensus (i) LFSFI TWL GYANS SLNP 1 1 YTT F ( SEQ ID NO: 84) 
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adrenergic consensus (2) VYTIYS SS VVFFAPSLAIMVITYT (SEQ ID NO: 85) 

Examples of a consensus GPR polypeptide for 
domain III across all adrenergic receptors are as follows: 

adrenergic consensus (3) IWLTSDIMSTSSILHNLCVISF (SEQ ID NO: 8b) 



5 An example of a consensus GPR polypeptide for domains III, 

V, and VII of all adrenergic receptors is as follows: 

adrenergic consensus (4) IWSIFSSDIWGYANHSSLAIMCPIVIYTV {SEQ ID NO:87) 

adrenergic consensus (5) IFTIFSSDIAVGYANHSSAAIMPIVIYSV (SEQ ID NO:88), 
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Wherein variations and substitutions of amino acids may be made as 
described herein. 

Non- limiting examples of consensus GPR polypeptides for 
transmembrane domain III across several or many, such as 1-500, or 
any range or value therein, G-protein receptors are as follows: 



YAI FVLYASAWLS FLNC P FI VTLNI (SEQ ID NO: 96) 



TM3- 


(1) 


TM3 - 


(2} 


TM3- 


(3) 


TM3- 


(4) 


TM3- 


(5) 


TM3- 


(6) 


TM3- 


(7) 


TM3- 


(8) 


TM3- 


(9) 


TM3- 


(10 


TM3 - 


(11 


TM3- 


(12 


TM3- 


(13 


TM3 - 


(14 


TM3- 


(15 


TM3- 


(16 


TM3- 


(17 


TM3- 


(18 


TM3 - 


(19 


TM3 - 


(20 


TM3 - 


(21 



WAIFVLYASAWLTFLNCPFIVTLNI (SEQ ID NO: 103) 
YAVFVLYASAWLSFLNMPFI VTLNI (SEQ ID NO: 104) 
YAVFVLYATAWLSFLNMPFIVTLNI (SEQ ID NO: 105) 
Y AVFVL Y AT AWLT F LNMP F I VTLN I (SEQ ID NO:106) 
YAVFVL YAS AV7LTFLNMP F I^vTLNI ( SEQ ID NO ; 107) 
YAI FVLYASAWLS FLNCVTAS I PFI VTLN I (SEQ ID N0:108) 
YAI FVLYASAWLS FLNCTSS I WTAS I VTLNI (SEQ ID NO: 109) 
YAIFVLYASAWLSFLNVTLNICTSSIV (SEQ ID NO: 110) 
YAIFVTjYASAWLSFLNTASILNLMFIVTLNI (SEQ ID NO:lll) 
YAI FVLYASAWLS FLNMAS I LNLPF I VTLN I (SEQ ID NO:112) 
YAI FVLYASAWLS FLNSGILLLAPFI VTLNI (SEQ ID NO: 113) 
YAI FVLYASAWLS FLNMSG I LLLAPFI VTLNI (SEQ ID NO: 114) 
YAI FVLYASAWLfFLNSELSVYTLTVCPFI VTLNI (SEQ ID NO: 115) 
YAI FVLYASAWLS FLNMSELSVYTLTVPF I VTLNI (SEQ ID N0:116) 
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TM3 - 


(7 7) 


V A T TTT 7T V 7\ C Tv WT 7\ O C T C 1 rV^PT rC T?T WO O T7 TT PTT MT / CUPi m mo -? -i -7 \ 

x AX c V 1» iioAnXaAohLo v x 1 Xj 1 Vb r UNUfr 1V1 UN X v bfc.y ID NO ; 1 1 7 J 




TM3 - 


(23 ) 


YAT FVT.VA Q AWT ACFT CT7VTT T^fDFTTTTT MT ( C CO TT\ MO • 1 1 Q \ 
x j-ix r viji -Ho/in i_ii-Loxii-to vz iiii vrr l VI luxr X I X Li insj ; 1 1 o ] 




TM3 - 


( 24 ) 


Y AT FVT.Y A C AWT err ACFT CT7Y A CFT C CTT r T w r r \7~t<rMlDX?T\7 T TT kit / crri m wr-\ t t o \ 
x >ix r v iji>o>injjo r v ik^lLoo 1L1 1 VINrlir r 1 vl UN X { bx^y ID WU : liy J 




TM3 - 


(25 ) 


YATFVT/Y"ACAWT.CFT.Mf^r:FT aT.WQT.rDFTVTT.NT f cpn Tn wn . i *)ni 
r v xj X rU/4n LiOr XjX^vTvjEj X.in_Lj ri o Xj^_r It X. V X XjXx X \ o XxJ liU . 1 Z U J 


5 


TM3 - 


(26 ) 


YATFVT.Y21Q AWT.C CT.MflfSFT AT WQT TTTT'T M T i CFO TTl MH- 1 Ol \ 




TM3 - 


(27) 


YATF17T.YAQ AWT jfVSFT AT WCT MODFTTrTT MT f CFO TTl MD • 1 O 1 
x j-tx r vl x >wri n Xjvjvj d Xj-ixj r% o XjXX V- tr r X v X XjXt X i oHi*^ XX/ iMU . 1 z Z / 




TM3 - 


(28) 


Y AT F\TT .V AfV2F T AT WOT ODT MO'DCTTTTT WT i Cffl Tfl M/^ • T *5 "3 \ 

x>*.x r vxgi>VjoliX-fiXjwoljor iiiNL-irr X V I ixWl I obl^ X-U xx\J.LsL6) 




TM3 - 


(29 ) 


YATFVT^YA^AWT CyyuT t rnVT f?.UT?T T Kfr*DT7 TTTT'T MT / C I?0 y Pi Tsir^ . i 0 4 ^ 




TM3 - 


(30) 


YAT T7\7T.VA Q AWT "CCITT T f (7VT rwpT t pT^TTri'! MT / Cl?0 m KTO . "1 O C \ 
x Hi r vXj x>\o>LrV i_ir r r XjXjr \j X XAjIN r XjXjirr JL V 1 XxIN X v^&W I J.^D ) 
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TM3 - 


( 31 ) 


YAT PVT.YAQ AWT.C jrT.WPAPTTVTTA TT A QT r'TTTTP T AT TX>"F TT7 T T*T MT ( Cun Tn ma . n £ \ 




TM3 - 


( 32 ) 


X +^~L r V Xj X >^OH HLii >\\_ r" X VJ-LX liioliLr X X IS X-HXj X f c X V X XuM 1 ^ uCy X D Li\J , X 1 ) 




TM3 - 


(33 ) 


YAIFVXtYATACFYVATTAOT.r'FTTFT AT.T ^TTT.Nr'PFTVTT.NT ( CFD TT) WD • 1 9A\ 




TM3 - 


(34 ) 


YAITACFYVATTAST.PFTTETATjTA^AWT^PT^NPPFTVTT^NT f Qrn TFl MO • i 0Q1 




TM3 - 


(35) 


YAIFVL.YATACFYVATTTFT AT.T ^ AWT.^FT.MPPFTVTT.MT f QFH Tfl mo • 1 ^ n ^ 
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TM3 - 


(36) 


YATFVT.YA^ AWT.QFT.fJAr*FYTr , T FAn\7r*FT TDFTITTTMT / CDH TTl MPi • i Oi ) 




TM3 - 


(37) 


YATFVT.YAC AWMAr'FVTP'T FAm/MTTT TT CFT WPDT7TT7TT MT ^ OT?n Tn XTr\ - t "a o ^ 




TM3 - 


( ** ft ) 


Irtlf VXjX r x XL>ljr A(jVL.r LxioAWLoriiNLrr IvIlilNi loa^ ID NO : 13 3 ) 




TM3 - 




VAT FtTT V7\ C!fn 7V"^ rXTM L.** T ' O TiTJT O CT MPnPTtFPT MT / O T7/~\ rn xt/™\ . t n >t \ 

x-Rxr VLiAi VUAVNJlr I bAWLtbr IxWV-PFX VTJaxNX t ID NO:134; 




TrVfi - 


v 4 u ; 


x Air b VDAVNx^^rvxjiASAWIjSF ID NO: 135) 
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TM"5 - 




x Air VXjX AbAWLb VDAVlMJlFTSF LiNCPFI VTXxNx ( SEQ ID NO: 136) 






i a o ^ 
l 4 ^ ; 


xAxi* VIjYASAWIjSFx^SVDAVNJIFTPFI VT ID NO: 137) 




XXT1J 


1 4 .3 ; 


xAlr VijXAisAWljbr IxNCPFI VSVDAVNPlx^l IIjNI ( SEQ ID NO: 138) 




X X\LO 


I A A \ 


xAx* vxji AbAWijbVDMjT 1 SFl^CPFIVTLiNI (SEQ ID NO: 13 9) 




X 1. XO 


V 4 O / 


x Axo VDAVNmr x r vLiAiiAWLo r XoNL-Pr X V 1 xjlNx (SEQ ID NO: 140; 
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TM*5 - 
J. i-io 


f A C \ 


VAT COT CI TT O T T 7\ T"T TT VRPH MT C T7T VT/^ r>TTT T*T» T XT T / CPA TT\ MA . t yi t \ 

x Air blxb Vr olxUAl Vlxx AisAWLibFIxNCPFI VTIiNI(bEQ ID NO:141) 




TM3 - 


(47) 


VATFITT.VACT CT7T7CT T MCMJT CCT MPDUT^rTT WT / crn Tn xrr> . t >i i \ 

x>ix r vxj x AoXjo v r oiiXiRlbAWLor ijINL-Pr X V X XjINX^oE^ iU 1NU : 14z J 




TM3 - 




VAT FT7T VAC AWT GT OT7"C*OT T TV T C? PT W^PfTTftM WT / T n vrr"\ . n x -5 \ 

x Ax r VXj X AoAWXjoXjo v r bXxioAX or xxJNt„Pr x V x XjJNX v olsy ID NO : 1 4 3 ) 




TM3 - 


(49 ) 


YAT FVT.VAC AWT. CFT. CT CT7T7CT T CTMPDrTTrFT MT / CPH TT^ M/**i . T A ^ 




TM3 - 


( 50 ) 


YAT FTTT.YAQ AWT CFT MT3FCT C\nrCT TAT TTTTT XJT / Cm TFt MA . i ACl 
x >-^-L r v Xj X i-ioJ-irt x-io r X-dN rr olio v r oxjXjAX XVI XjJN X \ ooy XD JNO:X4dJ 
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TM3 - 


( 51 ) 


YATFVT.YA'T , AWT.'rFT.Mr r \7 r FA r rTTJFT^7 T T , T MT ( CPA Tn MO - T A C \ 
X J-1X £■ VXiIAl/inxiJ. X* XjuLi V X >i 1 Xir r X V X XjIXX \ Ooy X XJ . In D / 




TM3 - 




Y AT FTTT.Y AT 1 AWT CFT WPTO C T\AT r PTAT'TTrT'? MT / CTTO Tn MO . 1 /t 7 \ 
i-Hx r VXiX AX AnXior Xx£MV_ XooXVVlAlXV 1 IxTiX \ bey XD WO : 14 / / 




TM3 - 


\ j j / 


YATFVT VAT'AWT CFT W\7TT MT HTTTTU / CFO Tn WO • T A Q ^ 
Xi-iX V VXj X AX AnX^or XjJN V X XxTi 1L1 1 11 V \ OAy 1JJ vi\J . X 4 o ) 




X 1U - 


i ji / 


V7A T ITT TT VATMJT TTT WTTtTTT KTT MfTTTTT KTT / O TT ^ Tn XTr\ . 1 A Q \ 

x Air Vxjx Al AWJL1 r LiTTxAI lJLjNI_il>lr I vILiNItbEy ID NO:14yj 




TM3 - 


|CC \ 

I J J / 


VAT TTT rT V7\T7\ WT CCT WX/TTV TTT WT T~) 17 TT TTH" WT / CPA T n W/^ . 1 C A \ 

x Air VltxAl Aw lib r XjNKLAi ILiNliPr I vILlNI \bfc>y ID KU: lbO) 


3 5 


TM3 - 


/ cr £T \ 

\ J O / 


VAT "CTTT V7V r PT\I»TT TPT MCA T T T T T\ OPTtTTT WT ( C?T7T> Tn XT/^\ . T C t \ 

x AX r VXjiAX AW ill r XjIN b L7 1 1 ■ 1 1 1 1A w 1 1 VI iolW 1 t bby ID NO ; 1 5 1 ) 




TM3 - 


(57) 


YAI FVLYaSAWXiTFLNI^xxXjIIjI^ (SEQ ID NO: 152) 




TM3 - 


(58) 


YAIFVLYASAWTjTFLiNTEI/TO fSEO TD NO -1^) 




TM3 - 


(59) 


YAIFVx^YASAWIjTFIiNICTELTVYTLTV^ (SEQ ID NO: 154} 




TM3 - 


( 60 ) 


YATFVTiYATAWT.ATFT.WYTT.TVTFT.WrPPTVTT.NT 1 Tn MO - 1 c; c \ 

X -T^-L L v J-J i j-v X J-\, r¥ Xoi-i X xl* XJ X VXXXjX V X r XjX* ^ Jr r X V X 1 1 \ u&y XLJ Vi \J . 1 j j ] 


4 0 


a I7XJ 


fC1 \ 

v o x ; 


lAlr VIiiAi)AWLAI hLb ViTLTVPF IVTLNI (bEQ ID NO: 156; 




TM3 - 


( £ " 










- T V Xj X -MtO M. W X>J\_3 X Kij W j. LjIh ^ i- r - v i .^I 1 * o v ' ^ ^ • - ~ 


4 5 


TM3 - 


(66) 


YAIFVLYAGGEIALWTLSFLNCPFIVTLNI (SEQ ID NO:16l) 
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TM3- 


(67) 


YAIFVLYATAWLSFFFLLFGYLGNFLLNCPFIVTLNI (SEQ ID N0:162) 




TM3- 


(68) 


YAIFVLYATAWLFFFLLFGYLGNFLLPFIVTLNI (SEQ ID NO: 163) 




TM3- 


(69) 


YAIFVLYATAWLTFLNTACFYVAITASLCFITEIALIPFIVTLNI (SEQ ID NO: 164) 




TM3- 


(70) 


YAIFVLYATAWLTACFYVAITATLCFITEIALICPFIVTLNI (SEQ ID NO: 165) 
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TM3- 


(71) 


YAI FVL Y ATAC F YVAI TATLC F I TE I AL I S F LNC P F I VTLN I ( S EQ ID NO: 166) 




TM3- 


(72) 


YAI TACF YVAI TAS LC F I TE I AL I ATAWLTFLNC P F I VTLNI (SEQ ID NO: 167) 




TM3- 


(73) 


YAI FVL Y ATAC F YVAI ITE IALITAWLTFLNCPFIVTLNI ( SEQ ID NO: 168) 




TM3 - 


(74) 


YAIFVLYASAWLTFLNACFYICLFAGVCFLIPFIVTLNI (SEQ ID NO:l69) 




TM3- 


(75) 


YAIFVLYASAWNACFYICLFAGVMFLILTFLNCPFIVTLNI (SEQ ID NO: 170) 
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TM3 - 


(76) 


YAIFVLYFYICLFAGVCFLIATAWLTFLNCPFIVTLNI (SEQ ID NO: 171) 




TM3- 


(77) 


YAI FVLYATVDAVNMFTTAWLTFIiNCPFI VTLNI (SEQ ID NO: 172) 




TM3- 


(78) 


YAI FTVDAVNMFTVLYATAWLTFLNCPFIVTLNI ( SEQ ID NO: 173) 




TM3- 


(79) 


YAI FVL YATAWLTVDAVNMFTS FLNC P F I VTLN I (SEQ ID NO: 174) 




TM3- 


(80) 


YAI FVLYATAWLS FLNTVDAVNMFTPF I VTLNI ( SEQ ID NO: 175) 
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TM3 - 


(81) 


YAIFVLYASAWLTFLNCPFIVSVDAVNMFTTLNI (SEQ ID NO: 176) 




TM3 - 


(82) 


YAIFVLYATAWLSVDMFTTFLNCPFIVTLNI (SEQ ID NO: 177) 




TM3- 


(83) 


YAISVDAVNMFTFVLYATAWLSFLNCPFIVTLNI (SEQ ID NO:178) 




TM3- 


(84) 


YAIFVLYASLTVFSLLAISAWLTFLNCPFIVTLNI (SEQ ID NO: 179) 




TM3- 


(85) 


YAIFVLYASAWLTLSVFTLLAISFLNCPFIVTLNI (SEQ ID NO: 180) 


20 


TM3- 


(86) 


YAIFVLYASAWLTFLSLSVFTLLAINCPFIVTLNI (SEQ ID NO: 181) 




TM3- 


(87) 


YAIFVLYASAWLTFLNPFSLSVFSLLAIIVTLNI (SEQ ID NO: 182) 




TM3- 


(88) 


YAIFVLYASAWLSFLNLGGVTASFTASVGPFIVTLNI (SEQ ID NO: 183) 




TM3- 


(89) 


YAI FVL YAS AWLS FLNLGGVTAS FTAS VGVTLNI ( SEQ ID NO : 1 84 ) 




TM3 - 


(90) 


YAIFVLLGGVTASFTASVNYASAWLSFLNCPFIVTLNI (SEQ ID NO: 185) 


25 


TM3 - 


(91) 


YAI FVL YAI FFFLLFS AWLS FLNCPFIVTLNI (SEQ ID NO: 186) 




TM3- 


(92) 


YAI FVLYASAWLSFLNCP FI VTLNI I FFFLLFI VTLNI (SEQ ID NO:187) 




TM3 - 


(93) 


YAI FVLYASAWIFFFLLFLS FLNCPFIVTLNI (SEQ ID NO: 188) 




TM3- 


(94) 


YAIFVLYASAWLFFTVLASELSVYTLTVSFLNCPFIVTLNI (SEQ ID NO: 189) 




TM3- 


(95) 


YAI FVL YAS AWLS FLFATLGGEIALCPFIVTLNI (SEQ ID NO: 190) 


30 


TM3- 


(96) 


YAI FVLYAFATLGGEIALSAWLS FLNCPFIVTLNI (SEQ ID NO: 191) 




TM3- 


(97) 


YAI FFTVLASELSVYTLTVYASAWLS FLNCPFIVTLNI (SEQ ID NO: 192) 




TM3- 


(98) 


YAI FFP I AALFAS I AS AWLS FLNCPFIVTLNI (SEQ ID NO:193) 




TM3- 


(99) 


YAIFVLYASAWLSFFPIAALFASIPFIVTLNI (SEQ ID NO:194) 




TM3- 


(100) 


YAI FVLYASAWLSFLNCP FFP IAALFAS I LN I (SEQ ID NO: 195) 
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TM3- 


(101) 


YAIFVLYASAWLSLDVLFSTASIMHLSFLNGGEIALWSLIVTLNI (SEQ ID NO: 196) 




TM3- 


(102) 


YAIFVLYASLDVLFSTASIMHLIALWSLNCPFIVTLNI (SEQ ID NO: 197) 




TM3- 


(103) 


YAIFVLYAGGEIALWSLSFLNSLDVLFSTASIMHLPFIVTLNI (SEQ ID NO: 198) 




TM3- 


(104) 


YAI FVL YAS AWLS FFDVLFSTAS IMHLFGYLGNFLLNCPFI VTLNI ( SEQ ID NO: 199) 




TM3 - 


(105) 


YAIFVLYASAWLFFFLLFGYLSLDVLFSTASIMHLGNFLLPFIVTLNI (SEQ ID NO: 200) 
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TM3- 


(106) 


YAIFVLYASAWLSFLNTACFYVAITASLSLMHLFITEIALIPFIVTLNI (SEQ ID NO: 201 




TM3 - 


(107) 


YASLDVLFSTAIMHLSAWLTACFYVAITASLCFITEIALICPFIVTLNI (SEQ ID NO: 202 




TM3- 


(108) 


YAI FVL Y ATAC F YVAI TAS L S F LNC P F I VTLN I S LD VL F S TAS I MHL (SEQ ID NO: 203) 




TM3- 


(109) 


YAITACFYVAITASLCF ITE I ALIAS AWLS FLNCPFIVTLNI (SEQ ID NO: 204) 




TM3- 


(110) 


YAIFVLYATACFYSTASILNLIMHLCAISLVAIITEIALISAWLSFLN (SEQ ID NO: 205) 


45 


TM3- 


(111) 


YAIFVLYASAWLSFLNACFYICLFASILNLIMHLGVCFLIPFIVTIoNI (SEQ ID NO: 206) 
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TM3 - 
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TM3 - 
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TM3- 
TM3- 
TM3 - 
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(126 
(127 
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(130 
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YAIFVLYASAWNASIl^IMHLCFYICLFAGVMLILSFI^CPFIVTLNI (SEQ ID NO:207) 
YAIFPFVQCWSIFSLVLIAWIiYFYIAGVCFLIASAWLSFLNCPFIVTI (SEQ ID NO: 208) 
P^^QCVSITVSIFSLVIjIAVYAIFVTiYASVD ( SEQ ID NO: 209) 

YAIFGDWSSVDAVNMFTVXYASAWLSFLNCPFIVTLNI (SEQ ID NO:210) 
YAIFVLYAGDWSSAVOjSVDAVNMFTSFI^CPFIVTLNI ( SEQ ID NO:211) 
YAIFVLYASAVnjGDWSSFIiNSVDAVNMFTPFIVTLNI (SEQ ID NO:212) 
YAIFVLYASAWLSFLNCPFIVGDWSSVDAVNMFTTLNI (SEQ ID NO: 213) 
YAIFVLYASAWIiGYIiGSVDMFTSFLNCPFIVTGDWSLNI (SEQ ID NO: 214) 
YAISVDAVNMFTFTLYAGYLGSAWLSFI^CPFIVTLNI (SEQ ID NO:215) 
YAIFSLSVFSLIJUiVLYASAWLGYIiGSFLNCPFIVTIiNI (SEQ ID N0:216) 
YAIFVLYAGYLGAGNMDSLSVFSLLAISAWLSFI^CPFIVTLNI (SEQ ID NO: 217) 
YAIFVLYASAWLSLSVTGNMSLLAISFLNCPFIVTTiNI (SEQ ID NO:218) 
YAIFVLYASAWLSFLSLSVFGGSLLAINCPFIVTLNI (SEQ ID NO:219) 
YAIFVLYASAWLSFLNPFSLSVFGSLLAIIVTLNI (SEQ ID NO: 220) 
YAIFVLYATAV^TFLSLANCVTATIPFIVTLNI (SEQ ID NO: 221) 
YAI FVL YATAVTLS FLNCTS LAS S I WTAT I VTLNI (SEQ ID NO: 222; 
YAIFVLYATAWLSFLNVTLNISLACTTTIV { SEQ ID NO: 223) 
YAIFVLYATAWLTFLNTATILSLANLMFIVTLNI (SEQ ID NO: 224) 
YAIFVLYATAWLSFLNMATILNL.PFSVDAVIVTLNI (SEQ ID NO: 225) 
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Recently discovered G-proteins also can be used according 
to the presently claimed invention to provide GPR polypeptides of the 
present invention, based on the teaching and guidance presented 
herein. Exampled of such GPR polypeptides of the present invention 
may include, as non- limiting examples, GPR polypeptides corresponding 
to transmembrane domain III, e.g., as follows: 
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TM3-(131) ISTMYTVTGRVTTI/^WCDFWLSSDITCCTASIIJ^CVIAL (SEQ ID NO:226) 

TM3-{132) ILYGYRWPLPSKLCAVWIYLDVLFSTASIMHLCAISL (SEQ ID NO:227) 

TM3-(133) IIYI V>IDRVnG^YFLCEVWLSVD^rrCCTCSILHLCVIAL (SEQ ID NO:228) 

TM3-(134) IADKTVRVAMGAENDLGYNFRSDDVCGHCWQWYCSL (SEQ ID NO:229) 

TM3-(135) I LNYWP FGLALCHFVNYSQAVS VLVSAYTLVAI S I (SEQ ID NO:230) 

TM3-(136) II^RTOFGIHLCKiVfLTCDVLCCTSSILJ^LCAIALD (SEQ ID NO:231) 

TM3-(137) IMASVMHRHCLPLIGICLSSERHCLVSIFVELGAL (SEQ ID NO:232) 



Further non- limiting examples of consensus GPR polypeptides 
for transmembrane domain III of several or many, such as 1-500, or 



3.ny rdn^6 or 



t- V> o "v o H ▼"t 



• protein 



TM3-:139- YAIFVLYATAVTLSFIJWCPFISILNI.CAIAI^VTLNI (SEQ ID NC : 2 3 4 • 
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TM3-U40) YAIFVLYATAVTLTFLNCPFISIFVELGALVTLNI (SEQ ID NO:235) 

TM3-U41) YAIFVLYASAWLTFLNCPFISIFVELSIMHLCAISLGALVTLNI (SEQ ID NO:236) 

TM3-(142) WAIFVLYAILGRWEFGIHLCKLWLTSAWLSIMHLCAISLSFLNCPFIVTLNI (SEQ ID NO:237) 

TM3-(143) WAIFVXYAILGRVreFGIHLCKXVTLTTAVn.SIMHLCA.ISLSFLNCPFIVTLNI (SEQ ID NO:238) 

5 TM3-(144) WAIFVLYATAWLTFLNCPFSIMHLCAISLIVTLNI (SEQ ID NO:239) 

TM3-(145) WAIFVLYASAWLTFLNCPFISIMHLCAISLVTLNI (SEQ ID NO:240) 

TM3-(146) YAVFVLYASAWLSFLNMSIMHLCAISLPFIVTLNI (SEQ ID NO:241) 

TM3-(147) YAVFVL YATAWLS F LNMP F S I LNLCAI ALD I VTLN I ( S E Q ID NO:242) 

TM3-(148) YAVFVLYATAWLSILNLG&IALDTFLNMPFIVTLNI (SEQ ID NO:243) 

10 TM3-(149) YAVFVLYASILNLCAIALDSAWLTFLNMPFIVTLNI (SEQ ID NO:244) 

TM3-(150) YAIFVLYASAWLSFLNCVTASIPFCLVSIFVELGALIVTLNI (SEQ ID NO:245) 

TM3-(151) YAI FVLYASAWLS FLNCLVS I FVELGALI WTAS I VTLN I (SEQ ID NO:246) 

TM3-(152) YAI FVLYASAWLS FLNVTLNCLVS I FVELGALI I (SEQ ID NO:247) 

TM3-(153) YAIFVLYASAVnjSFLNTASILNLMFICLVSIFVELGALVTLNI (SEQ ID NO:248) 

15 TM3-(154) YAIFVLYASAWLSFLNMASILNLPFCLVSIFVELGALVTLNI (SEQ ID NO:249) 

TM3- (155) YAIFVLYASAWLSFI^ILGRWEFGIHLCKLWLTCDVLCCTSSGILLLAPFIVTLNI (SEQ ID NO:250) 

TM3- (156) YAI FVLYASAWLS FLNMIL£RWEFGIHLCKLWLTCDVLCCTS SGI LL^ (SEQ ID NO; 251) 

TM3 - (157) YAIFVLYASAWLIUSRWEFGIHLCKLWLTCDVLCCTSSFI^SELSVYTLTVCPFIVTLNI (SEQ ID 
NO:252) 

20 TM3- (158) YAIFVLYAILGRWEFGIHLCKLWLTCDVLCCTSSAWLSFUmSELS\^ ID 
NO:253) 

TM3-(159) YAIFVLYASAWLASRWPLPLSVYTLTVSFLNCPFIVTLNI (SEQ ID NO:254) 

TM3 - (160) YAIFVLYASAWLASELILYYWRWPLPCLHDLVWLCTCSILiiLCVIALSVirTLTTO (SEQ ID 

NO:255) 

25 TM3 - (161) YAIFVLYASAWLSFLASELSVYASELSSTLHDLVWLWLDVFC^ (SEQ ID 

NO:256) 

TM3 - (162) YAIFVLYASAWLSFLNGGEIALWSLCPFIILYYWRWPLPCLHDLVSILHLCVIALVTLNI (SEQ ID 
NO:257) 

TM3-U63) YVWLWI^VFCCTCSIUILOTIALF^YASAWLSFI^GGEIALWSLIVTI^I (SEQ ID NO:258) 

30 TM3-(164) YAIFVLYASAWLAIILYYWRWPLPCLHDLGGEIALWSLNCPFIVTLNI (SEQ ID NO:259) 

Non- limiting examples of consensus GPR polypeptides for 
domain V across several or many, such as 1-500, or any range or value 
therein, G-protein receptors are as follows: 





TM5- 


(1) 


CDVFVFVDIMLCTASIFNLCAISVG (SEQ ID NO: 260) 


35 


TM5 - 


(2) 


YAIFVLYDIMLCTASIFNLCAISVG (SEQ ID NO: 2 61) 




TM5- 


(3) 


DYAIFVFVDIMLMTASIFNLMAISVG(SEQ ID NO: 262) 




TM5- 


(4) 


D YAI FVFVD I MLHTTAST I FNLMAT I TVG (SEQ ID NO: 2 63) 




TM5- 


(5) 


CDVAWYSSDIMLFYVCTASIFSSNLCAISSVG (SEQ ID NO: 2 64) 




TM5- 


(6) 


FLFCSLGSFYIPIAVILVDIMLCTASIFNLCAISVG (SEQ ID NO: 265) 


40 


TM5- 


(7) 


YAIFVLYDFLFCSLGSFYIPIAVILIMLCTASIFNLCAISVG (SEQ ID NO: 266) 




TM5- 


(8) 


DYAIFVFVDIMLMTASIFLFCSLGSFYIPIAVILISVG ( SEQ ID NO:267) 




TM5- 


(9) 


DYAIFVFVDI>ILHTTASTIFNLMAFLFCSLGSFYIPIAVILTITVG(SEQ ID NO: 268 
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TM5 - 


(10) 


CDVAVVYSSDIMLFYVCTASIFSSNLFLFCSLGSFYCAISSVG (SEQ ID NO: 2 69) 




TM5 - 


(11) 


CDVFVFVDIMLCTASIFNWYILSSIGSFFAPCLILLVYLLCAISVG ( SEQ ID NO: 2 70) 




TM5- 


(12) 


YAIFVLYDIMLCTASIFNLCTUVTCILSSIGSFFAPCLILLVYLSVG (SEQ ID NO: 271) 




TM5- 


(13) 


DYAIFVFVDIWILSSIGSFFAPCLIIJ.VYIASIFNIiMAISVGtSEQ ID NO: 272) 


5 


TM5- 


(14) 


DYAIWYILSSIGSFFAPCLIIJjVYLIMLHTTASTIFNIjMATITVG ( SEQ ID NO: 273) 




TM5 - 


(15) 


CDVAWYSSDIMLFYVCWYILSSIGSFFAPCLILLVYLSSNLCAISSVG(SEQ ID NO: 2 74) 




TM5- 


(16) 


CD VFVFVD I MLCTAS I F WYVT S S S I G S F FAP CL I NHLVYNLCAI SVG (SEQ ID NO: 275) 




TM5- 


(17) 


YAIFVLYDIMLCTASIFNLCAIWYVISSSIGSFFAPCLINHLVYSVG(SEQ ID NO: 276) 




TM5- 


(18) 


DYAIFVFVWYVISSSIGSFFAPCLINHLVYT>IMLMTASIFNLMAISVG (SEQ ID NO:277) 


10 


TM5 - 


(19 ) 


DYAIFVFVDIMIJnTASTIFWYVISSSIGSFFAPCLINHLVYTVG (SEQ ID NO: 278) 




TM5 - 


(20) 


CDVAVVYSSDIMLFYVCTASIFSVTYVISIGSFFAINHLVYNLC1AISSVG (SEQ ID NO:279) 




TM5- 


(21) 


CD VFVFVD I MLCTAS I FNLCAITYAI S S S VI S FY I P VAI LVTYT (SEQ Tu NO: 280) 




TM5 - 


(22) 


YAI FVLYDIMLCTATYAI S SSVI S F YI PVAI LVTYTS I FNLCAI SVG (SEQ ID NO:28l) 




TM5- 


(23) 


D YA I FVFVD I MLMTATY AI SSSVISFYIP VAI LVTYT I S VG (SEQ ID NO: 282) 


15 


TM5 - 


(24) 


TYAI S S S VI S FY I P VATD YAI FVFVD IMLHTTASTI FNLMATI TVG (SEQ ID NO:283) 




rrrwi- 

ii'U - 


\-*>-> i 


CD VAVVY S 3 D IMLF YVCTATYAI S S S VI S FY I PVAI LVT~ tT S S VG ( SEQ ID NO ; 2 8 4 ) 




TM5- 


(26) 


CDWVFVDFVIYSSWSFYLPFGVTVLVYACTASIF ID NO: 285) 




TM5- 


(27) 


YAI FVLYDFVIYSSVVSFYLPFGVTVLVYAS I FNLCAI SVG (SEQ ID NO:286) 




TM5- 


(28) 


DYAIFWVDFVTYSSWSFYLPFGVTVLVYATASIFNLMAISVG (SEQ ID NO: 287) 


20 


TM5 - 


(29) 


DYAIFVFVDFVIYSSWSFYLPFGVTVLVTAHTTASTIFNLMATITVG (SEQ ID NO:288) 




TM5- 


(30) 


CDVAVVYSSDFVIYSSWSFTLPFGVTVYVCTASIFSSNLCAISSVG(SEQ ID NO: 289) 




TM5- 


(31) 


CDVFVFVDIMLCTASYTIYSTCGAFYIPSVLLIILYGNLCAISVG (SEQ ID NO: 290) 




TM5- 


(32) 


YAIFVLYDIMLCTASYTIYSTCGAFYIPSVLLIILYGNLCAISVG (SEQ ID NO:29l) 




TM5 - 


(33) 


DYAI FVFVD IMLMTASYTIYSTCGAFYIPSVLLIILYGNLMAI SVG (SEQ ID NO: 292) 


25 


TM5 - 


(34) 


DYAIFVFVDIMLHTTASYTIYSTCGAFYIPSVLLIILYGMATITVG (SEQ ID NO: 293) 




TM5- 


(35) 


CD VAWYS SD IMS YT I YSTCGAF Y I P S VLLI I LYG I FS SNLCAI S SVG ( S EQ ID NO:294) 




TM5- 


(36) 


CDVFVFFVLI GS FVAVD IMLCTAS I FNLCAI SVG (SEQ ID NO: 295) 




TM5 - 


(37) 


YAI FVLYFVLIGSFVADIMLCTAS I FNLCAI SVG (SEQ ID NO: 29 6) 




TM5- 


(38) 


DYAIFVFVFVLIGSFVADIMLMTASIFNLMAISVG (SEQ ID NO: 297) 


30 


TM5- 


(39) 


DYAIFVFVFVLIGSFVADIMLHTTASTIFNLMATITVG (SEQ ID NO: 298) 




TM5- 


(40) 


CDVAVVYSS FVLIGSFVADIMLFYVCTAS I FS SNLCAI S SVG (SEQ ID NO: 299) 




TM5- 


(41) 


CDVFVFVDIMLCFFIPTLIMVITYFNLCAISVG (SEQ ID NO: 3 00) 




TM5- 


(42) 


YAIFVLYDIMLCFFIPTLIMVTTYFFNLCAISVG (SEQ ID NO: 301) 




TM5- 


(43) 


DYAIFVFVDIMLMFFIPTLIMVTTYFNLMAISVG (SEQ ID NO: 302) 


35 


TM5 - 


(44 ) 


DYAIFVFVDIMLHTFFIPTLIMVTTYFNLMATITVG (SEQ ID NO: 303) 




TM5- 


(45) 


CDVAWYS SDIMLFYVCFF I PTLIMVITYFS SNLCAI S SVG (SEQ ID NO: 3 04) 




TM5 - 


(46) 


CDVVYGLVDGLVTFYLPLLIMCITYYDIMLCTASIFNLCAISVG(SEQ ID NO: 305) 




TM5 - 


(47) 


YAIVYGLVDGLVTFYLPLLIMCITYYD IMLCTAS I FNLCAI SVG ( SEQ ID NO: 306) 




TM5 - 


(48) 


DYAIVYGLVDGLVTFYLPLLIMCITYYDIMLMTASIFNLMAISVG ( SEQ ID NO: 307) 


a n 
-± w 


TM5 - 


(43) 


D YA I VYG LVDGL VT F Y L P LL I MC I S S D I MLHTTAS T I FNLMAT I TVG (SEQ ID NO: 308) 




TM5 - 




DYAIFVF\TDIMLMLVIFIX3LVIVIPFVLIIVSYAIFNLMAISVG ( SEQ 1 ZD NO : 3 1 2 .■ 


4 5 


TM5 - 


(54 ) 


DYAI FVFVD I MLHTLVI FLGLVIVI PFVLI IVS YAI FNLMATI TVG ( SEQ ID NO .313) 
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- 32 - 

CDVAVVYSSDIMLFLVIFU3LVIVIPFVLIIVSYAIFSSNLCAISSVG(SrJQ ID NO: 3 14) 
CDVT^VDI>Q J CTALMiyiI^LIIIIPFLLIVMSYVSIFNLCAISVG(SEQ ID NO: 315) 
YAI FVLYDIMLCTALMIYILGGLI I I IPFLLIVMSYVS IFNLCAI SVG ( SEQ ID NO: 316) 
DYAIFVFVDIMLmT^IFNmiYILGGLIIIIPFLLI ID NO; 317) 

DYAIFVFVDIMIJnTASTILMIYII^LIIIIPFLLIVMSYVITVG{SE^ ID NO: 318) 
CDVAVVYSSDIMLFWCTAYIUK3LIPFLLIVMTYVSIFTNLCAISSVG(SEQ ID NO: 3 19) 
CDVFVFVDIMLCTASIFNI^IM ID NO: 320) 

YAI FVLYDIMLCTAS I FNLLMIHIMEVT 1 1 VTPFVLI VISYACAI SVG ( SEQ ID NO: 321) 
DYAIFVFVDIMLMTASIFLMIHIMEVIIIVIPFVLIVISYAISVG(SEQ ID NO:322) 
DYAIFVFVDIMLHTTASTILMIHIMEVIIIVIPFVLIVISYAITVG (SEQ ID NO: 323) 
CDVAWYSSDIMLFYVCTASIFLMIHIMEVIIIVIPFVLIVISYAAISSVG{SEQ ID NO: 324 ; 



Non- limiting examples of longer consensus GPR polypeptides 
for domain V across several or many, such as 1-500, or any value or 
range therein, G-protein receptors are as follows: 



15 t m i . ( , j 

TMlNWPMiSIWIIINTITCNILVIMATO^ 
IGYVCSSSLGINPVIIYTLF (SEQ ID NO: 325) 

T H 1 - ( 2 , 

NWPALSIVVIIINTIGGNILVIMAVTIYTTLDV 
20 VCTTTLGINPVTIYTLF (SEQ ID NO: 32 6} 

T M 1 . ( 3 j 

NWPALTI WI I INTIGGNI LVIMAVS I YTTLDVMLCTATI LNLLITLFVLIGTFVAFFIPLTIMVITYFLFNVFFVWIGY 
VCSTSLGINPVIIYTLF (SEQ ID NO: 327) 

T M 1 . ( 5 } 

2 5 NWPALTI WI I INTIGGNILVIMAVTIYTTLDVMLCTATILNLLITLF^IGTFV 
VCTLGINPVIIYTLF (SEQ ID NO: 32 8} 

T M 1 - ( 6 ) 

NWKNWSALLTTWIILTIAGNILVIMAVSSI^ 
VCSSSLGINPVIIYTLF(SEQ ID NO: 329) 
30 T M 1 . ( 7 } 

ITITVVIAVLILITVAGNVWCIAVGSIYTSLDVMLCTASII^ 
YVCSSSLGINPVIIYTLF (SEQ ID NO: 330) 

T M 1 - ( 8 , 

TLTLVCIACLUSLTVFGNVLV^ ^ 
35 SSLGINPVIIYTLF (SEQ ID NO: 331) 

T M 1 - { 9 ] 

TAAI AAAITFLILFTI FGNALVI IAVLS I YTSLDVMLCTAS I LNLLI SLFVLIGSFVAFFIPLTIMVITYFLFNVFFVWI 
GYVCS SSLGINPVIIYTLF (SEQ ID NO: 332) 

T M 1 - ( 1 0 ) 

40 AISVGLVLGAFILFAIVGNILVILSVANWPALSIW^ 

FVAFFIPLTIMVITYFLFNVFFVWI GYVCS SSLGINPVIIYTLF (SEQ ID NO: 333) 
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T M 1 (11) 

AALAGALIJVI^VIJVTVGGNLLVIVAIA^ 

SSSLGINPVIIYTLF (SEQ ID NO: 334) 

T M 1 - ( i 2 ) 

5 TAGDCLIMLIVLLIVAGNVLVIVAISIJDV 
SLGINPVIIYTLF (SEQ ID NO: 335) 

T M 1 - (13) 

VITIAVVTAVVSLMTIVGNVTjVMISFSIYTSIJTVMLCT 
YVCSS SLGINPVIIYTLF (SEQ ID NO: 336) 
10 T M 1 - ( i 4 } 

MVTIATVRGSLSLVTWGNILVl^SISIYTSLDVMLCTASILNLLIS 
YVCSSSLGINPVIIYTLF (SEQ ID NO:337) 

T M 1 - ( i 5 } 

WFIAFLTGILALVTIIGNILVIVSFSIYTSLDVMLCTASILNLLISLFVLIGSFVJ^ 
15 VCSS SLGINPVIIYTLF (SEQ ID NO: 338) 



Non- limiting examples of longer consensus GPR polypeptides 
for domain V across several or many, such as 1-500, or any value or 
range therein, G-protein receptors are as follows: 

T M 3 - (16 5) 

2 0 *^PALSIVVII INTIGGNILVIMAPFACFVLVLTQSSIFSLLA^ 

VWIGYVCSS SLGINPVIIYTLF (SEQ ID NO: 339) 

T M 3 - ( 1 6 6 ) 

NWPALS I WI I INTIGGNILVIMAFFACFVLVLTQSS I FS 
CSSSLGINPVIIYTLF (SEQ ID NO: 34 0) 
25 T M 3 - ( 1 6 7 ) 

NWPALS I WI I INTIGGNI LVIMAVMVACPVLI LTQS S 1 1 ALLAIAVS FVAFFI PLTIMVITYFLFNVFFVWIGYVCSSS 
LGINPVIIYTLF (SEQ ID NO: 341) 

T M 3 - (16 8) 

NWPALSIWIIINTIGGNILVIMAVLWLAIJDYVASNASV^ 

3 0 PVIIYTLF(SEQ ID NO: 342) 

T M 3 - { l 6 9 ) 

NWPALSIWIIINTIGGNILVIMAVLYWSNASVMNLLIISSFVA^ 

IIYTLF (SEQ ID NO: 34 3) 

T M 3 - ( l 7 0 ) 

3 5 NWPALSIWIIINTIGGNILVIMAVLWIAIDYVASNASVLNL^ 
SLGINPVIIYTLF (SEQ ID NO: 344) 

T M 3 - ' I 7 1 



NWPALS I'"/ 1 I INTIGGNILVIMAVCITYLQYLGINASSCS ITAFTI I G S FVAF F I ? LT * MV I TY F L FNVF FVW I G YVC S 
SSLGINPVIIYTLF (SEQ ID NO: 346) 
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T M 3 (17 3) 

NWPALSIWIIIOTIGGNILVIMAWHNFFPIAA^ 

LGINPVIIYTLF (SEQ ID NO: 34 7) 

T M 3 (17 4) 

NWPALSIWIIINTIGGNILVIMAVIASASVSFNLYASVFLLTCLSIGS 

S LGINPVIIYTLF (SEQ ID NO:348) 

As another non- limiting , illustrative example of a GPR 
polypeptide consensus sequences across each individual or different 
transmembrane domains of 5-HT receptors may be made, such as for 5- 
HT, as the following: 

5HT consensus(4) KNASALLS VI I INS IGGNWTAVS (SEQ ID NO:349); 
5HT consensus (5) YFLMSLAVTDLWSFVMPVSAL (SEQ ID NO:350); 
5HT consensus (6) AITKIAITWAISGVSVPFIPVWG (SEQ ID NO:351); and 
5HT consensus(7) LGIIFGTFIIIWLPFFITNLVSPI (SEQ ID NO:352); 

Wherein variations and substitutions of amino acids may be made as 
described herein. 

Alternatively, 5-HT consensus sequences may be 
provided as consensus peptides of the present invention as consensus 
peptides for individual transmembrane domains, such as 5-HT domains 
III, V and VII, e.g., as follows: 

5-HT consensus (8): IWISLDVLFSTASSIMHLCAISL (SEQ ID NO:353) 
5-HT consensus (9) : GYTIYSTLVTFYIPSVTMVITYG {SEQ ID NO: 354 J 
5-HT consensus (10) : LLNFFNWIGYLNSLINPVIYTLF (SEQ ID NO: 355) 

This invention is also directed to an antibody which binds 
an epitope specific for a GPR polypeptide of the present invention 
and the use of such an antibody to detect the presence of, or measure 
the quantity or concentration of, the GPR protein in a cell, a cell 
or tissue extract, a biological fluid, an extract thereof, a 
solution, or sample, in vitro, in situ, or in vivo. 
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The term "antibody" is meant to include polyclonal 
antibodies, monoclonal antibodies (mAbs), chimeric antibodies, anti- 
idiotypic (anti-Id) antibodies to antibodies specific for GPR 
polypeptide of the present invention, as well as fragments, consensus 
5 polypeptides or chemical derivatives thereof. 

Polyclonal antibodies are heterogeneous populations of 
antibody molecules derived from the sera of animals immunized with 
an antigen. 

A monoclonal antibody contains a substantially homogeneous 
10 population of antibodies specific to antigens, which population 
contains substantially similar epitope binding sites. MAbs may be 
obtained by methods known to those skilled in the art. See, for 
example Kohler and Milstein, Nature 256:495-497 (1975); U.S. Patent 
No. 4,376,110; Ausubel et al , eds . , Current Protocols in Molecular 
15 Biology, Wiley Interscience , N.Y. , (1987, 1992); and Harlow and Lane 
Antibodies; A Laboratory Manual Cold Spring Harbor Laboratory (19 88) , 
the contents of which references are incoporated entirely herein by 
reference. Such antibodies may be of any immunoglobulin class 
including IgG, IgM, IgE, IgA, GILD and any subclass thereof. A 

2 0 hybridoma producing a mAb of the present invention may be cultivated 

in vi tro, in si tu or in vivo . Production of high titers of mAbs in 
vivo or in situ makes this the presently preferred method of 
production . 

Chimeric antibodies are molecules different portions of 
25 which are derived from different animal species, such as those having 
variable region derived from a murine mAb and a human immunoglobulin 
constant region, which are primarily used to reduce immunogenicity 
in application and to increase yields in production , for example , 
where murine mAbs have higher yields from hybridomas but higher 

3 0 immunogenicity in humans, such that human/murine chimeric mAbs are 

used. Chimeric antibodies and methods for their production are known 
in the art (Cabilly et al , Proc . Natl. Acad. Sci . USA 81:3273-3277 
(1984); Morrison et al . , Proc. Natl. Acad. Sci. USA 81:6851-6855 
( 19 84 ) ; Boulianne et al . , Nature 312 : f,4 - - - 4^ 'iQP4 v - ; ~Bbi — v ^ ^~ 

European Patent Application 17149^ v published February lb, 1985; ; 
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Morrison et al . , European Patent Application 173494 (published March 
5, 1986); Neuberger et al . , PCT Application WO 86/01533, (published 
March 13, 1986); Kudo et al . , European Patent Application 184187 
(published June ±1, 1986); Morrison et al . , European Patent 
5 Application 173494 (published March 5, 1986); Sahagan et al . , J. 
Immunol. 137:1066-1074 (1986); Robinson et al . , International Patent 
Publication No .PCT/US86/02269 (published 7 May 1987); Liu et al . , 
Proc. Natl. Acad. Sci . USA 84:3439-3443 (1987); Sun et al . , Proc. 
Natl. Acad. Sci. USA 84:214-218 (1987); Better et al., Science 

10 240:1041- 1043 (1988); and Harlow and Lane AntiJbodies; A Laboratory 
Manual Cold Spring Harbor Laboratory (1988)). These references are 
incorporated entirely herein by reference. 

An anti- idiotypic (anti-Id) antibody is an antibody which 
recognizes unique determinants generally associated with the antigen- 

15 binding site of an antibody. An Id antibody can be prepared by 
immunizing an animal of the same species and genetic type (e.g., 
mouse strain) as the source of the mAb with the mAb to which an anti- 
Id is being prepared. The immunized animal will recognize and 
respond to the idiotypic determinants of the immunizing antibody by 

2 0 producing an antibody to these idiotypic determinants (the anti -Id 
antibody). See, for example, U.S. patent No. 4,699,880, which is 
herein entirely incorporated by reference. 

The anti -Id antibody may also be used as an "immunogen" to 
induce an immune response in yet another animal, producing a so- 

25 called anti-anti-Id antibody. The anti-anti-Id may be epitopically 
identical to the original mAb which induced the anti -Id. Thus, by 
using antibodies to the idiotypic determinants of a mAb, it is 
possible to identify other clones expressing antibodies of identical 
specificity . 

30 Accordingly, mAbs generated against a GPR polypeptide of 

the present invention may be used to induce anti- Id antibodies in 
suitable animals, such as BALB/c mice. Spleen cells from such 
immunized mice are used to produce anti -Id hybridomas secreting anti- 
Id mAbs. Further, the anti -Id mAbs can be coupled to a immunogenic 

35 carrier such as keyhole limpet hemocyanin (KLH) or cationized bovine 
serum albumin and used to immunize additional BALB/c mice. Sera from 
these mice will contain anti-anti-Id antibodies that have the binding 
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properties of the original mAb specific for a GPR polypeptide 
epitope . 

The anti-Id mAbs thus have their own idiotypic epitopes, or 
"idiotopes" structurally similar to the epitope being evaluated. 
5 The term "antibody" is also meant to include both intact 

molecules as well as fragments thereof, such as, for example, Fab and 
F(ab') 2 , which are capable of binding antigen. Fab and F(ab') 2 
fragments lack the Fc fragment of intact antibody, clear more rapidly 
from the circulation, and may have less non-specific tissue binding 
10 than an intact antibody (Wahl et al . , J . Nucl . Med. 24 : 316 - 325 
(1983) ) . 

It will be appreciated that Fab and F ( ab 1 ) 2 and other 
fragments nf the antibodies useful in the present invention may be 
used for the detection and quantitation of a GPR polypeptide 

15 according to the methods disclosed herein for intact antibody 
molecules. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or 
pepsin (to produce F(ab ! ) 2 fragments) . 

An antibody is said to be "capable of binding" a molecule 

20 if it is capable of specifically reacting with the molecule to 
thereby bind the molecule to the antibody. The term "epitope" is 
meant to refer to that portion of any molecule capable of being bound 
by an antibody which can also be recognized by that antibody. 
Epitopes or "antigenic determinants " usually consist of chemically 

25 active surface groupings of molecules such as amino acids, lipids or 
sugar side chains and have specific three dimensional structural 
characteristics as well as specific charge characteristics . 

An "antigen" is a molecule or a portion of a molecule 
capable of being bound by an antibody which is additionally capable 

3 0 of inducing an animal to produce antibody capable of binding to an 
epitope of that antigen. An antigen may have one, or more than one 
epitope. The specific reaction referred to above is meant to 
indicate that the antigen will react, in a highly selective manner, 
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detect a GPR polypeptide in a sample or to detect presence of cells 
which express a GPR polypeptide of the present invention. This can 
be accomplished by immunofluorescence techniques employing a 
f luorescently labeled antibody {see below) coupled with light 
5 microscopic, flow cytometric, or fluorometric detection. 

The antibodies (of fragments thereof) useful in the present 
invention may be employed histologically, as in immunofluorescence 
or immunoelectron microscopy, for in situ detection of a GPR 
polypeptide of the present invention. In situ detection may be 

10 accomplished by removing a histological specimen from a patient, and 
providing the a labeled antibody of the present invention to such a 
specimen. The antibody (or fragment) is preferably provided by 
applying or by overlaying the labeled antibody (or fragment) to a 
biological sample. Through the use of such a procedure, it is 

15 possible to determine not only the presence of a GPR polypeptide but 
also its distribution on the examined tissue. Using the present 
invention, those of ordinary skill will readily perceive that any of 
wide variety of histological methods (such as staining procedures) 
can be modified in order to achieve such in situ detection. 

20 Such assays for a GPR polypeptide of the present invention 

typically comprise incubating a biological sample, such as a 
biological fluid, a tissue extract, freshly harvested cells such as 
lymphocytes or leukocytes, or cells which have been incubated in 
tissue culture, in the presence of a detectably labeled antibody 

25 capable of identifying a GPR polypeptide, and detecting the antibody 
by any of a number of techniques well-known in the art. See, e.g., 
Harlow and Lane, supra ; Ausubel et al , supra ; and Sambrook et al , 
supra . 

The biological sample may be treated with a solid phase 
30 support or carrier, such as nitrocellulose, or other solid support 
or carrier which is capable of immobilizing cells, cell particles or 
soluble proteins. The support or carrier may then be washed with 
suitable buffers, followed by treatment with a detectably labeled GPR 
polypeptide- specif ic antibody. The solid phase support or carrier 
35 may then be washed with the buffer a second time to remove unbound 
antibody. The amount of bound label on said solid support or carrier 
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may then be detected by known method steps, see, e.g. , Harlow, supra ; 
Ausubel, supra ; or Sambrook, supra . 

By "solid phase support", "solid phase carrier", "solid 
support", "solid carrier", "support" or "carrier" is intended any 
5 support or carrier capable of binding antigen or antibodies. Well- 
known supports or carriers , include glass , polystyrene , 
polypropylene, polyethylene, dextran, nylon amylases , natural and 
modified celluloses, polyacrylamides , gabbros , and magnetite. The 
nature of the carrier can be either soluble to some extent or 

10 insoluble for the purposes of the present invention. The support 
material may have virtually any possible structural configuration so 
long as the coupled molecule is capable of binding to an antigen or 
antibody. Thus, the support or carrrier configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of 

15 a test tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet , polymer test strip, etc . 
Preferred supports or carriers include polystyrene beads. Those 
skilled in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain the same by use of 

2 0 routine experimentation . 

The binding activity of a given lot of anti-GPR polypeptide 
antibody may be determined according to well known method steps. 
Those skilled in the art will be able to determine operative and 
optimal assay conditions for each determination by employing routine 

25 experimentation. See, e.g. , Harlow, supra . 

Other such steps as washing, stirring, shaking, filtering 
and the like may be added to the assays as is customary or necessary 
for the particular situation . 

One of the ways in which a GPR polypeptide - specif ic 

3 0 antibody , ant i - idio type antibody or fragment thereof , can be 
detectably labeled is by linking the same to an enzyme and use in an 
enzyme immunoassay { EIA) . This enzyme, in turn, when later exposed 
to an appropriate substrate , will react with the substrate in such 



are not i imi ted :c , malate dehydrogenase , staphylococcal nuclease , 
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delta- 5 - steroid isomerase, yeast alcohol dehydrogenase, alpha - 
glycerophosphate dehydrogenase, triose phosphate isomerase, 
horseradish peroxidase; alkaline phosphatase, asparaginase, glucose 
oxidase, beta-galactosidase, ribonuclease , urease, catalase, glucose- 
5 6- phosphate dehydrogenase, glucoamylase and acetylcholinesterase. 
The detection can be accomplished by colorimetric methods which 
employ a chromogenic substrate for the enzyme. Detection may also 
be accomplished by visual comparison of the extent of enzymatic 
reaction of a substrate in comparison with similarly prepared 

10 standards. See, Harlow, supra , Ausubel, supra . 

Detection may be accomplished using any of a variety of 
other immunoassays. For example, by radioactivity labeling the 
antibodies or antibody fragments, it is possible to detect R-PTPase 
through the use of a radioimmunoassay (RIA) . A good description of 

15 RIA maybe found in Laboratory Techniques and Biochemistry in 
Molecular Biology, by Work et al . , North Holland Publishing Company, 
NY (1978) with particular reference to the chapter entitled "An 
Introduction to Radioimmune Assay and Related Techniques" by Chard, 
incorporated entirely by reference herein. The radioactive isotope 

20 can be detected by such means as the use of a 7-counter, a 
scintillation counter or by autoradiography. 

It is also possible to label an anti-GPR polypeptide 
antibody, anti - idiotype antibody or fragment thereof, with a 
fluorescent compound. When the f luorescently laoeled antibody is 

25 exposed to light of the proper wave length, its presence can be then 
be detected due to fluorescence. Among the most commonly used 
fluorescent labelling compounds are fluorescein isothiocyanate , 
rhodamine , phycoerythrin , phycocyanin, allophycocyanin , 
o-phthaldehyde and fluorescamine, commercially avai lable, e.g., from 

3 0 Molecular Probes, Inc. (Eugene, Ore.). 

The antibody can also be detectably labeled using 
fluorescence emitting metals such as 152 EU, or others of the 
lanthanide series. These metals can be attached to the antibody 
using such metal chelating groups as diethylenetriamine pentaacetic 

3 5 acid (EDTA) . 

The antibody also can be detectably labeled by coupling it 
to a chemiluminescent compound. The presence of the 
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chemiluminescent- tagged antibody is then determined by detecting the 
presence of luminescence that arises during the course of a chemical 
reaction. Examples of particularly useful chemiluminescent labeling 
compounds are luminol , isoluminol , theromatic acridinium ester, 
5 imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
the antibody of the present invention. Bioluminescence is a type of 
chemiluminescence found in biological systems in which a catalytic 
protein increases the efficiency of the chemiluminescent reaction. 

10 The presence of a bioluminescent protein is determined by detecting 
the presence of luminescence. Important bioluminescent compounds 
for purposes of labeling are luciferin, lucif erase and aequorin. 

An antibody molecule of the present invention may be 
adapted for utilization in a immunometric assay, also known as a 

15 "two-site" or "sandwich" assay. In a typical immunometric assay, a 
quantity of unlabeled antibody (or fragment of antibody) is bound to 
a solid support or carrier and a quantity of detectably labeled 
soluble antibody is added to permit detection and/or quantitation of 
the ternary complex formed between solid-phase antibody, antigen, and 

2 0 labeled antibody. 

Typical , and preferred, immunometric assays include 
"forward" assays in which the antibody bound to the solid phase is 
first contacted with the sample being tested to extract the antigen 
form the sample by formation of a binary solid phase antibody- antigen 

25 complex. After a suitable incubation period, the solid support or 
carrier is washed to remove the residue of the fluid sample, 
including unreacted antigen, if any, and then contacted with the 
solution containing an unknown quantity of labeled antibody (which 
functions as a "reporter molecule"). After a second incubation 

30 period to permit the labeled antibody to complex with the antigen 
bound to the solid support or carrier through the unlabeled antibody, 
the solid support or carrier is washed a second time to remove the 
unreacted labeled antibody. 



reverse" assays are used. A simultaneous assay involves a smqle 
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incubation step as the antibody bound to the solid support or carrier 
and labeled antibody are both added to the sample being tested at the 
same time. After the incubation is completed, the solid support or 
carrier is washed to remove the residue of fluid sample and 
5 uncomplexed labeled antibody. The presence of labeled antibody 
associated with the solid support or carrier is then determined as 
it would be in a conventional "forward" sandwich assay. 

In the "reverse" assay, stepwise addition first of a 
solution of labeled antibody to the fluid sample followed by the 

10 addition of unlabeled antibody bound to a solid support or carrier 
after a suitable incubation period is utilized. After a second 
incubation, the solid phase is washed in conventional fashion to free 
it of the residue of the sample being tested and the solution of 
unreacted labeled antibody. The determination of labeled antibody 

15 associated with a solid support or carrier is then determined as in 
the "simultaneous" and "forward" assays. See, e.g., for the above- 
mentioned immunological techniques, Harlow, supra ; Ausubel et al, 
suera; and Sambrook et al, supra . GPR polypeptides of the present 
invention can be made by chemical synthesis or by recombinant 

20 methods, wherein chemical synthesis is preferred. 

Synthetic product ion of transmembrane proteins of the 

present invention 

GPR polypeptides, variants and chemical derivatives thereof 

can be synthesized according to known method steps, including 
25 portions of known GPR transmembrane domains, consensus peptides 

thereof, conservative substitution derivative thereof or functional 

derivatives thereof. 

Chemical polypeptide synthesis is a rapidly evolving area 

in the art, and methods of solid phase polypeptide synthesis are 
3 0 well -described in the following references, hereby entirely 

incorporated by reference: (Merrifield, B., j\ Amer. Chem. Soc. 

85:2149-2154 (1963); Merrifield, B., Science 232:341-347 (1986); 

Wade, J.D. et al . , Biopolymers 25 : S21 - S37 (1986); Fields, G.B., Int. 

J. Polypeptide Prot. Res, 35:161 (1990); MilliGen Report Nos . 2 and 
35 2a, Millipore Corporation, Bedford, MA, 1987) Ausuoel et al , supra . 

and Sambrook et al . supra . 
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In general, as is known in the art, such methods involve 
blocking or protecting reactive functional groups, such as free 
amino, carboxyl and thio groups. After polypeptide bond formation, 
the protective groups are removed (or de-protected). Thus, the 
addition of each amino acid residue requires several reaction steps 
for protecting and deprotecting . Current methods utilize solid phase 
synthesis, wherein the C- terminal amino acid is covalently linked to 
an insoluble resin particle large enough to be separated from the 
fluid phase by filtration. Thus, reactants are removed by washing 
the resin particles with appropriate solvents uding an automated 
programmed machine The completed polypeptide chain is cleaved from 
the resin by a reaction which does not affect polypeptide bonds. 

In the more classical method, known as the "tBoc method. " 
the amino group of the amino acid being added to the resin-bound 
C- terminal amino acid is blocked with tert-butyloxvcarbonyl chloride 
(tBoc) . This protected amino acid is reacted with the bound amino 
acid in the presence of the condensing agent 
dicyclohexylcarbodiimide, allowing its carboxyl group to form a 
polypeptide bond the free amino group of the bound amino acid. The 
amino -blocking group is then removed by acidification with 
trifluoroacetic acid (TFA) ; it subsequently decomposes into gaseous 
carbon dioxide and isobutylene. These steps are repeated cyclically 
for each additional amino acid residue. A more vigorous treatment 
with hydrogen fluoride (HF) or trif luoromethanesulf onyl derivatives 
is common at the end of the synthesis to cleave the benzyl - derived 
side chain protecting groups and the polypeptide- resin bond. 

More recently, the preferred "Fmoc" technique has been 
introduced as an alternative synthetic approach, offering milder 
reaction conditions, simpler activation procedures and compatibility 
with continuous flow techniques. This method w.:is used, e.g., to 
prepare the peptide sequences disclosed in the present application. 
Here, the «-amino group is protected by the base labile 
9-fluorenylmethoxycarbonyl (Fmoc) group. The benzyl side chain 
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solution is used instead to cleave side chain protecting groups and 
the polypeptide resin linkage simultaneously. 

At least three different polypeptide- resin linkage agents 
can be used: substituted benzyl alcohol derivatives that can be 
5 cleaved with 95% TFA to produce a polypeptide acid, methanolic 
ammonia to produce a polypeptide amide, or 1% TFA to produce a 
protected polypeptide which can then be used in fragment condensation 
procedures, as described by Atherton, E. et al . , J. Chem. Soc. Perkin 
Trans. 1:538-546 (1981) and Sheppard, R.C. et al . , Int. J. 

10 Polypeptide Prot. Res. 20:451-454 (1982) . Furthermore, highly 
reactive Fmoc amino acids are available as pentaf luorophenyl esters 
or dihydro-oxobenzotriazine esters derivatives, saving the step of 
activation used in the tBoc method. 

Sequences available to use as a basis for polypeptide 

15 synthesis can be based on published sequences of G-protein coupled 
receptors, ligands and/or effectors, wherein the transmembrane or 
functional domains correspond to sections of hydrophobic or other 
amino acids of 5 to 100 amino acids, such as 5-10, 10-15, 15-25, 
20-25, 23-27, 25-30, 28-35, 20-40, 10-40, 20-30, 30-40, 40-50, 10-80, 

20 20-60 or 25-40 amino acids in length. Recombinant production of GPR 
polypeptides can be accomplished according to known method steps. 
Standard reference works setting forth the general principles of 
recombinant DNA technology include Watson, J.D. et al . , Molecular 
Biology of the Gene, Volumes I and II, The Benjamin/Cummings 

25 Publishing Company, Inc., publisher, Menlo Park, CA (1987); Darnell, 
J.E. et al . , Molecular Cell Biology, Scientific American Books, Inc., 
publisher, New York, NY (1986); Lewin, B.M., Genes III, John Wiley 
& Sons, publishers, New York, NY (1989); Old, R.W., et al . , 
Principles of Gene Manipulation: An Introduction to Genetic 

30 Engineering, 2d edition, University of California Press, publisher, 
Berkeley, CA (1981); Ausubel et al , eds., Current Protocols in 
Molecular Biology, Wiley Interscience, publisher, New York, NY (1987, 
1992) ; and Sambrook et al . , Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor Laboratory, publisher, Cold 

35 Spring Harbor, NY (1989) , the entire contents of which references are 
herein incorporated by reference. 
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A nucleic acid sequence encoding a GPR polypeptide of the 
present invention may be recombined with vector DNA in accordance 
with conventional techniques, including blunt -ended or 
staggered- ended termini for ligation, restriction enzyme digestion 
5 to provide appropriate termini , filling in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avoid undesirable 
joining, and ligation with appropriate ligases. Techniques for such 
manipulations are disclosed, e.g., by Ausubel et al , supra, and are 
well known in the art . 

10 A nucleic acid molecule, such as DNA, is said to be 

"capable of expressing" a polypeptide if it contains nucleotide 
sequences which contain transcriptional and translational regulatory 
information and such sequences are "operably linked" to nucleotide 
sequences which encode the polypeptide. An operable linkage is a 

15 linkage in which the regulatory DNA sequences and the DNA sequence 
sought to be expressed are connected in such a way as to permit gene 
expression as GPR polypeptides in recoverable amounts. The precise 
nature of the regulatory regions needed for gene expression may vary 
from organism to organism, as is well known in the analogous art. 

20 See, e.g., Sambrook, supra and Ausubel supra. 

The present invention accordingly encompasses the 
expression of a GPR polypeptide, in either prokaryotic or eukaryotic 
cells, although eukaryotic expression is preferred. 

Preferred hosts are bacterial or eukaryotic hosts including 

25 bacteria, yeast, insects, fungi, bird and mammalian cells either in 
vivo, or in situ, or host cells of mammalian, insect, bird or yeast 
origin. It is preferred that the mammalian ceU or tissue is of 
human, primate, hamster, rabbit, rodent, cow, pig, sheep, horse, 
goat, dog or cat origin, but any other mammalian cell may be used. 

3 0 Further, by use of, for example, the yeast ubiquitin 

hydrolase system, in vivo synthesis of ubiquitin- transmembrane 
polypeptide fusion proteins may be accomplished. The fusion proteins 
so produced may be processed in vivo or purified and processed in 
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avoided. Sabin et al . , Bio/Technol . 7(7): 705-709 (1989); Miller et 
al., Bio/Technol. 7(7): 698-704 (1989). 

Any of a series of yeast gene expression systems 
incorporating promoter and termination elements from the actively 
5 expressed genes coding for glycolytic enzymes produced in large 
quantities when yeast are grown in mediums rich in glucose can be 
utilized to obtain GPR polypeptides of the present invention. Known 
glycolytic genes can also provide very efficient transcriptional 
control signals. For example, the promoter and terminator signals 

10 of the phosphoglycerate kinase gene can be utilized. 

Production of GPR polypeptides or functional derivatives 
thereof in insects can be achieved, for example, by infecting the 
insect host with a baculovirus engineered to express transmembrane 
polypeptide by methods known to those of skill. See Ausubel et al , 

15 eds. Current Protocols in Molecular Biology, Wiley Interscience , 
§§16.8-16.11 (1987, 1992) . 

In a preferred embodiment, the introduced nucleotide 
sequence will be incorporated into a plasmid or viral vector capable 
of autonomous replication in the recipient host. Any of a wide 

20 variety of vectors may be employed for this purpose. See, e.g., 
Ausubel et al, supra , §§ 1.5, 1.10, 7.1, 7.3, 8.1, 9.6, 9.7, 13.4, 
16.2, 16.6, and 16.8-16.11. Factors of importance in selecting a 
particular plasmid or viral vector include: the ease with which 
recipient cells that contain the vector may be recognized and 

25 selected from those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired in a particular 
host; and whether it is desirable to be able to "shuttle" the vector 
between host cells of different species. 

Preferred prokaryotic vectors known in the art include 

3 0 plasmids such as those capable of replication in E . coli (such as, 
for example, pBR322, ColEl, pSClOl, pACYC 184, ttVX) . Such plasmids 
are, for example, disclosed by Maniatis, T., et al . (Afolecular 
Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor 
Press, Cold Spring Harbor, NY (1989); Ausubel et al, eds., Current 

35 Protocols in Molecular Biology, Wiley Interscience, New York, NY 
(1987, 1992)). Bacillus plasmids include pC194, pC221, pT127, etc. 
Such plasmids are disclosed by Gryczan, T. (In: The Molecular 
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Biology of the Bacilli, Academic Press, NY (1982), pp. 307-329). 
Suitable Streptomyces plasmids include pIJlOl (Kendall, K.J. , et al . , 
J". Bacteriol . 169:4177-4183 (1987)), and streptomyces bacteriophages 
such as </>C31 (Chater, K.F., et al . , In: Sixth International 
5 Symposium on Actinomycetales Biology, Akademiai Kaido, Budapest, 
Hungary (1986), pp. 45-54). Pseudomonas plasmids are reviewed by 
John, J.F., et al. (JRev. Infect. Dis. 8:693-704 (1986)), and Izaki, 
K. (Jpn. J. Bacteriol. 31:729-742 (1978); and Ausubel et al, supra ) . 

The expressed protein may be isolated and purified in 
10 accordance with conventional conditions, such as extraction, 
precipitation, chromatography , affinity chromatography , 
electrophoresis, or the like. For example, the cells may be 

r**s*\ 1 V^, , „^..,-;*_'u . - * . t-» n ~ w. . -cc a a «_ 1 

protein isolated by column chromatography, for example, on 

15 DEAE- cellulose , phosphocellulose , polyribocytidylic acid- agarose , 
hydroxyapatite or by electrophoresis or immunoprecipitation . 
Alternatively, the transmembrane polypeptide or functional derivative 
thereof may be isolated by the use of anti- transmembrane polypeptide 
antibodies. Such antibodies may be obtained by well-known methods, 

2 0 some of which are mentioned below. These antibodies may be 
immobilized on cellulose, agarose, hollow fibers, or cellulose 
filters by covalent chemical derivatives by methods well known to 
those skilled in the art. 

As discussed herein, GPR polypeptides of the present 

25 invention may be further modified for purposes of drug design, such 
as for example to reduce immunogenicity , to prevent solubility and/or 
enhance delivery, or to prevent clearance or degradation. 

Appropriate modification of the primary amino acid sequence 
of GPR polypeptides of the present invention, obtained by mutagenesis 

30 or utilizing fragments of other related forms of G-protein 
transmembrane proteins, as described herein, will allow the creation 
of molecules which bind G-protein coupled receptors with higher 
affinity than that exhibited by naturally occurring transmembrane 
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advantages over larger polypeptides. These advantages include (1) 
greater stability and dif fusibility , and (2) less immunogenicity . 

Since polypeptides according to the present invention are 
generally small (10-40, 20-30, 15-25, 30-45 amino acids), cell or 
5 tissue sources of G-protein coupled receptors are not required to 
practice the present invention, since known polypeptide syntheses 
steps can be used without undue experimentation to provide GPR 
polypeptides or sequences substantially corresponding thereto. 
Pharmaceutical Preparations 

10 Preparations of GPR polypeptides for parenteral 

administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions, which may contain auxiliary agents or 
excipients which are known in the art. Pharmaceutical compositions 
such as tablets and capsules can also be prepared according to 

15 routine methods. 

By the term "protection" from infection or disease as used 
herein is intended "prevention," "suppression" or "treatment." 
"Prevention" involves administration of a GPR polypeptide, 
polypeptide derivative, or anti- idiotypic antibody prior to the 

20 induction of the disease. 

"Suppression" involves administration of the composition 
prior to the clinical appearance of the disease. 

"Treatment" involves administration of the protective 
composition after the appearance of the disease. It will be 

25 understood that in human and veterinary medicine, it is not always 
possible to distinguish between "preventing" and "suppressing" since 
the ultimate inductive event or events may be unknown, latent, or the 
patient is not ascertained until well after the occurrence of the 
event or events. Therefore, it is common to use the term 

30 "prophylaxis" as distinct from "treatment" to encompass both 
"preventing" and "suppressing" as defined herein. The term 
"protection," as used herein, is meant to include "prophylaxis." 

At least one GPR polypeptide, antibody or anti - idiotypic 
antibody of the present invention may be administered by any means 

35 that achieve their intended purpose, for example, to treat GPR 
related pathologies, such as psychotic disorders, including 
schizophrenia, by inhibition of binding of Dopamine D : receptors 
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using a GPR polypeptide corresponding to a fragment or consensus 
portion of a dopamine D 2 transmembrane domain ; in the form of a 
pharmaceutical composition . 

For example, administration of such a composition may be by 
5 various parenteral routes such as subcutaneous , intravenous , 
intradermal, intramuscular, intraperitoneal, intranasal, transdermal, 
or buccal routes. Alternatively, or concurrently, administration may 
be by the oral route. Parenteral administration can be by bolus 
injection or by gradual perfusion over time. 

10 A preferred mode of using a GPR pharmaceutical composition 

of the present invention is by intravenous or parenteral application. 

A typical regimen for preventing, suppressing, or treating 
G - protein coupled receptor pat hoi ogxes , such as dopamine receptor 
related schizophrenia , comprises administration of an effective 

15 amount of a GPR polypeptide , consensus sequence , or chemical 
derivative thereof , administered over a period of one or several 
days, up to and including between one week and about 24 months. 

It is understood that the dosage of a GPR polypeptide of 
the present invention administered in vivo or in vi tro will be 

20 dependent upon the age , sex, health , and weight of the recipient , 
kind of concurrent treatment, if any, frequency of treatment, and the 
nature of the effect desired. The ranges of effective doses provided 
below are not intended to limit the inventors and represent preferred 
dose ranges. However, the most preferred dosage will be tailored to 

25 the individual subject, as is understood and determinable by one of 
skill in the art , without undue experimentation . 

The total dose required for each treatment may be 
administered by multiple doses or in a sing] e dose . a GPR 
polypeptide or functional a chemical derivative thereof may be 

30 administered alone or in conjunction with other therapeutics directed 
to GPR related pathologies, such as a the dopamine receptor related 
pathology as a non limiting example, or directed to other symptoms 
of the disease . 
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weight, such 0.05, 0.07, 0.09, O.i, 0.5, 0.7, 0.9, 1, 2, 5, 10, 20, 
25, 30, 40, 45, or 50 mg/kg. 

Preparations for parenteral administration include sterile 
aqueous or non-aqueous solutions, suspensions, and emulsions, which 
5 may contain auxiliary agents or excipients which are known in the 
art. Pharmaceutical compositions such as tablets and capsules can 
also be prepared according to routine methods. 

Pharmaceutical compositions comprising at least one GPR 
polypeptide of the present invention may 

10 include all compositions wherein the GPR polypeptide is contained in 
an amount effective to achieve its intended purpose. In addition to 
the GPR polypeptide, a pharmaceutical composition may contain 
suitable pharmaceutically acceptable carriers, such as comprising 
excipients and auxiliaries which facilitate processing of the active 

15 compounds into preparations which can be used pharmaceutically. 

Pharmaceutical compositions include suitable solutions for 
administration intravenously, subcutaneously , dermally, orally, 
mucosally, rectally or may by injection or orally, and contain from 
about 0.01 to 99 percent, preferably from about 2C to 75 percent of 

20 active component (i.e. the antibody) together with the excipient. 
Pharmaceutical compositions for oral administration include tablets 
and capsules. Compositions which can be administered rectally 
include suppositories . 

Example 1: Synthesis of a G- Protein Transmembrane Polypeptide and 
25 Consensus Polypeptide 

The polypeptides in Figs. 1-5 were synthesized using the 
following procedure and include the following characteristics. 

Peptide I (SEQ ID N0:1) , as shown in Fig. 1, was used as a 
control for hydrophobic interaction alone as the mechanism of binding 

3 0 and was run in parallel with the test polypeptides described below. 
Polypeptide II (SEQ ID NO:2), as shown in Fig. 2, represents a 
membrane- spanning fragment of transmembrane segment III in the 
dopamine D : receptor. This particular fragment was chosen since it 
has been implicated in the j3-adrenergic receptor as having many 

35 residues which are involved in ligand binding interaction. 
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Polypeptide III (SEQ ID NO : 3 ) , as shown in Fig. 3, represents the 
consensus polypeptide which was developed as a model for the dopamine 
D 2 system and polypeptide IV (SEQ ID NO:4), as shown in Fig. 4, is 
a control for length dependence to show how critical the polypeptide 
5 length is in binding studies. Polypeptide V (SEQ ID NO: 5) , as shown 
in Fig. 5, is a consensus sequence of transmembrane domains of 
dopamine receptors Dj and D : . 

The above polypeptides I-V (SEQ ID NOS:l-5), as shown in 
Figs. 1-5, respectively, were synthesized using solid phase synthesis 

10 on a Milligen 9600 polypeptide synthesizer using Fmoc amino acids 
(provided by Milligen/Biosearch) and PAL polystyrene resin 
(Milligen/Biosearch) . Coupling times were 1 hour and the 

polypeptides were cleaved by triflucrcacctic 
acid/phenol/H 2 0/thioanisole/ethanedithiol (82.5:5:5:5:2.5) at room 

15 temperature for 2 hours. The filtrate was collect -d and washed with 
2 iuL of trif luoroacetic acid (TFA) and 1 mL of dichloromethane (DCM) . 
The filtrate was reduced in vacuo to 2 ml in volume and the resulting 
polypeptide was precipitated out by the addition of water. The 
polypeptides were then dissolved in 1, 1, 1, 3 , 3 , 3 -hexaf luoro-2 -propanol 

20 [ (HFIP) Eastman]; lyophilized; and stored at -20°C until 
purification. Polypeptides I-V (SEQ ID N0S:l-5) , were purified using 
reverse -phase HPLC using a preparative Vydac C4 column (Vydac) at 
60°C at a flow rate of 6.0 mL/min with a linear gradient of 0-100% 
B in a 60 min period at a UV detection wavelength of 275 nm. 

25 Due to the highly hydrophobic nature of these polypeptides, 

methanol was used v.-ith 0.1% (W/V) TFA and 0.5% (W/V) HFIP as solvent 
A and 2 -propanol with 0.1% TFA as solvent B, in order to purify these 
polypeptides. Further purification was performed with an analytical 
C4 column (Vydac) with an isocratic gradient of 40% B at a flow rate 

30 of l ml /min. Identity of the polypeptides was confirmed by Fast-atom 
bombardment mass spectrometry and electrospray mass spectrometry and 
amino acid analysis. Stock solutions of polypeptides were made in 
HFIP and stored at -20°- SO °C . 
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(Aldrich) and the wavelength of the CD signal was set using standard 
absorbance peaks of benzene vapor. Polypeptide concentrations were 
determined in a Cary 210 UV spectrophotomer with the absorbance , 
measured at 280 nm. Helical content was estimated using CD signal 
5 intensity according to the method of Chen, et al Biochem . 13:3350- . 
3359 (1974) . This calculation compares the experimental ellipticity 
at 222 nm ([6] 222) ([6]) to a theoretical [6] 222. The theoretical 
[0]222 is empirically adjusted to account for differences in 
polypeptide length and is based on experimental CD data from a series 

10 of proteins with known crystal structures. Since both the curve 
shape and magnitude are important in analysis of a CD spectrum for 
secondary structure contributions, we also considered qualitatively 
the contributions to the spectral shapes from different secondary 
structures using reference curves for poly (L- lysine) . 

15 Fig. 6 shows a CD spectrum of the consensus polypeptide III 

(SEQ ID NO: 3) demonstrating that the polypeptide III is only 
partially helical in a solvent system in which most membrane 
polypeptides are strongly helical. 

Preparation of Small Unilamellar Vesicles . Polypeptides 

2 0 were incorporated into DMPC vesicles at lipid: peptide ratio of 147:1 

in the following manner: polypeptide in HFIP was mixed with 
dimyrystyroyl- phosphatidylcholine (synthetic) (DMPC) in dry 
chloroform and dried to a film with a stream of dry nitrogen at 0°C. 
This residue was then dried further overnight under a vacuum (1 x 10" 2 
25 torr) . The residue was then hydrated in 100 mM NaCl and sonicated 
for a 30-min period under nitrogen at 0°C. The suspension was 
sedimented for a 30-min at 100,000 g (4°C) to remove any residual 
titanium particles and large unilamellar vesicles. The supernatant 
was removed and sedimented once more at 159,000 g for a 45 min period 

3 0 at 4°C. The supernatant in the lower portion was used immediately. H 

This basic procedure has been shown to reliably produce small 
unilamellar vesicles. 

Radioligand Binding Assays. A 0.50 mL volume of 1.00 nM 
[ 3 H] -spiperone (New England specific activity 21.4 Ci/mmol) was added 
35 to assay tubes which contained 0 . 5 mL lipid/peptide supernatant, 0.5 
mL Tris buffer pH 7 . 4 and 0.5 mL of cold drug for a final volume of 
2.0 mL. Nonspecific binding was defined in the presence of i uM of 
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( -O butaclamol or l uM spiperone. Appropriate controls for lipid 
vesicles containing no polypeptide were also run. Assay tubes were 
prepared in triplicate and the mixture was incubated for l h at 25°C. 
Incubation was terminated by filtration through filters presoaked in 
5 0.1% polyethyleneimine (w/v, Sigma) for at least 1 h prior to use. 

Filters were then washed with 6.0 mL of cold 50 mM Tris-HCl 
buffer, pH 7.40. For detection of radioactivity, filters were placed 
in 2 . 0 mL of scintillation fluid (Scint iverse ) and incubated for 24 
h. The activity of the tritium was determined in a Beckman LS 7500 

10 liquid scintillation counter. Specific binding of [ 3 H] -spiperone was 
defined as the difference in binding in the presence and absence of 
unlabeled ( +) butaclamol . 

Fig. 7 shows results of radioligand binding assays 
comparing polypeptide I (SEQ ID N0:1) as a control unit polypeptide 

15 III (SEQ ID NO: 3) according to the present invention. Polypeptide 
III (SEQ ID NO: 3) is shown to unexpectedly provide receptor- like 
functional binding, as demonstrated by binding to the neuroleptic 
agent, spiperone, into a stereoselective, concentration- dependent 
manner. 

20 It has also been demonstrated that as little as 0.1% of a 

GPR polypeptide according to the present invention is able to form 
a receptor- like functional binding site. Thus, a GPR polypeptide of 
the present invention is unexpectedly shown to act both as GPR 
ligands and GPR binding sites. 

25 All references cited herein, including journal articles or 

abstracts, published or corresponding U.S. oi foreign patent 
applications, issued U.S. or foreign patents, or any other 
references, are entirely incorporated by reference herein, including 
all data, tables, figures, and text presented in the cited 

30 references. Additionally, the contents of the references cited 
within the references cited herein are also entirely incorporated by 
reference . 

Reference to known method steps, conventional methods 
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The foregoing description of the specific embodiments will 



so fully reveal the general nature of the invention that others can, 
by applying knowledge within the skill of the art (including the * 
contents of the references cited herein) , readily modify and/or adapt 
5 for various applications such specific embodiments, without undue - 
experimentation, without departing from the generic concept of the 
present invention. Therefore, such adaptations and modifications are 
intended to be comprehended within the meaning and range of 
equivalents of the disclosed embodiments, based on the teaching and 
10 guidance presented herein. It is to be understood that the 
phraseology or terminology herein is for the purpcse of description 
and not of limitation, such that the terminology or phraseology of 
the present specification is to be interpreted by the skilled artisan 
in light of the teachings and guidance presented herein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



Ci) APPLICANT: Murphy, Randall B. 

Schuster, David I. 

5 (ii) TITLE OF INVENTION: POLYPEPTIDES OF G - COUPLED RECEPTOR PROTEINS, AND 

COMPOSITIONS AND METHODS THEREOF 
(iii) NUMBER OF SEQUENCES: 95 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: B ROWDY AND NEIMARK 
10 (B) STREET: 419 Seventh Street, N.W. 

(C) CITY : Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP : 20004 

15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

( C ) OPERATING SYSTEM : PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

2 0 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/943,236 
<B) FILING DATE: 10-SEP-1992 

(C) CLASSIFICATION: 
(viii) ATTORNEY /AGENT INFORMATION: 
25 (A) NAME: Townsend, Kevin G. 

(B) REGISTRATION NUMBER: 34,033 

(C) REFERENCE /DOCKET NUMBER: MURPHY =2 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-628-5197 

3 0 (B) TELEFAX: 202-73 7-3528 

(C) TELEX: 248633 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 i near 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

4 0 Leu Ser Leu Leu Leu Ser Leu Leu Ser Leu Leu Leu Ser Leu Leu Ser 

15 10 15 

Leu Leu Leu Ser Leu Tyr Tyr Tyr 

20 

(2) INFORMATION FOR SEQ ID NO : 2 : 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

5 0 (ii) MOLECULE TYPE: peptide 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

d) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 26 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Asp Tyr Ala He Phe Val Leu Tyr Ala Ser Ala Trp Leu Ser Phe Asn 
10 1 5 10 15 

Cys Pro Phe He Val Thr Leu Asn He Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Lys Ala Val Val Tyr Ser Ser He Val Ser Phe Tyr Val Phe lie Asp 
15 10 15 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Asp Cys Asp Val Phe Val Phe Val Asp He Met Leu Cys Thr Ala Ser 
15 10 15 

lie Phe Asn Leu Cys Ala He Ser Val Gly Lys 
20 25 

3 5 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ser Leu Val Leu Leu Leu Phe Ala Asp Phe Ser Ser Met Leu Gly Cys 
15 10 15 

4 5 Met Ala Val Leu He Gly Phe Trp Arg Leu Lys Leu Leu Arg Asn His 

20 25 30 

Val Thr Lys Val lie Ala Cys Phe Cys Ala Thr Ser Phe Cys Lys Asp 
35 40 45 

Phe Pro Ser Thr He Leu Thr Leu Thr Asn Thr Ala Val Asn Gly Gly 
50 50 55 60 

Phe Pro Cys Tyr Leu Tyr Ala lie Val lie Thr Tyr Gly Ser Phe Ala 

65 70 75 80 
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Cys Trp Leu Trp Thr Leu lie Cys Leu Ala lie Ser lie Tyr Met Leu 
85 90 95 

lie Val Lys Arg Glu Pro Glu Pro Glu Leu Phe Glu Lys Tyr Tyr Tyr 
100 105 11.; 

5 Leu Leu Cys Trp Gly Leu Pro Leu lie Ser Thr lie Gly Leu Lys Asn 

115 120 125 

Thr Val Gin Phe Val Gly Asn Trp Cys Trp lie Gly Val Ser Phe Thr 
130 135 140 

Gly Tyr Arg Phe Gly Leu Phe Tyr Pro Phe Leu Phe lie Trp Ala lie 
10 145 150 155 160 

Ser Ala Val Leu Val Gly Leu Thr Ser Arg Tyr Thr Tyr Trp lie His 
165 170 175 

Asn Gly Val Ser Asp Asn Lys Glu Lys His Leu Thr Tyr Gin Phe Lys 
180 185 190 

15 Leu lie Asn Tvr lie lie Val Phe Leu Val Cvs Ttd Val Phe Ala Val 

195 * 200 205 

Val Asn Arg lie Val Asn Gly Leu Asn Trp Pro Pro Ala Leu Asn lie 
210 215 220 

Leu His Thr Tyr Leu Ser Val Ser His Gly Phe Trp Ala Ser Val Thr 
20 225 230 235 240 

Phe lie Tyr Asn Asn Pro Leu Met Trp Arg Tyr Phe Gly Ala Lys lie 
245 250 255 

Leu Thr Val Phe Thr Phe Phe Gly Tyr Phe Thr Asp Val Gin Lys Lys 
260 265 270 

25 Leu Glu Lys Asn Leu Ser Pro Tyr Ser Ser Ser Arg Gly Thr Ser Gly 

275 280 285 

Lys Thr Met Leu Gly His Pro Thr Gly Asp Asp Val Gin Cys Ser Ser 
290 295 300 

Asp Leu Gin Cys Ser Leu Glu Arg His Pro Asn Met Val 
30 305 310 315 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE : amino acid 

3 5 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Val Tyr lie Thr Val Glu Leu Ala lie Ala Val Leu Ala Thr Leu Gly 

40 1 5 10 15 

Asn Val Leu Val Cys Trp Ala Val Trp Leu Asn Ser Asn Leu Asn Val 



5 0 5 5 



Ala Cys His Asn Cys Leu Phe Phe Ala Cys Phe Val Leu Val Leu Thr 
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65 70 75 80 

Gin Ser Ser lie Phe Ser Leu Leu Ala lie Ala lie Asp Arg Tyr He 
85 90 95 

Ala He Arg He Pro Leu Arg Tyr Asn Gly Leu Val Thr Gly Thr Arg 
5 100 105 110 

Ala Lys Gly He He Ala Val Cys Trp Val Leu Ser Phe Ala He Gly 
115 120 125 

Leu Thr Pro Met Leu Gly Trp Asn Asn Cys Ser Gin Pro Lys Glu Gly 
130 135 140 

10 Arg Asn Tyr Ser Gin Gly Cys Gly Glu Gly Gin Val Ala Cys Leu Phe 

145 150 155 160 

Glu Asp Val Val Pro Met Asn Tyr Met Val Tyr Tyr Asn Phe Phe Ala 
165 170 175 

Phe Val Leu Val Pro Leu Leu Leu Val Tyr Leu Arg He Phe Leu Ala 
15 180 185 190 

Ala Arg Arg Gin Leu Lys Gin Met Glu Ser Gin Pro Leu Pro Gly Glu 
195 200 205 

Arg Ala Arg Ser Thr Leu Gin Lys Glu Val His Ala Ala Lys Ser Ala 
210 215 220 

20 He He Val Gly Leu Phe Ala Leu Cys Trp Leu Pro Leu His He He 

225 230 235 240 

Asn Cys Phe Thr Phe Phe Cys Pro Glu Cys Ser His Ala Pro Leu Trp 
245 250 255 

Leu Met Tyr Leu Thr He Val Leu Ser His Thr Asn Ser Trp Asn Pro 
25 260 265 270 

Phe He Tyr Ala Tyr Arg He Arg Glu Phe Arg Gin Thr Phe Arg Lys 
275 280 285 

He He Arg Ser His Val Leu Arg Arg Arg Glu Pro Phe Lys Ala Gly 
290 295 300 

3 0 Gly Thr Ser Ala Arg Ala Leu Ala Ala His Gly Ser Asp Gly Glu Gin 

305 310 315 320 

He Ser Leu Arg Leu Asn Gly His Pro Pro Gly Val Trp Ala Asn Gly 
325 330 335 

Ser Ala Pro His Pro Glu Arg Arg Pro Asn Gly Tyr Thr 
35 340 345 

( 2 ) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ala Tyr lie Gly He Glu Val Leu He Ala Leu Val Ser Val Pro Gly 
45 1 5 10 15 

Trp Leu Val lie Trp Ala Val Lys Val Asn Gin Ala Leu Arg Asp Ala 
20 25 30 
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Thr Phe Cys Phe He Val Ser He Ala Val Ala Asp Val Ala Val Gly 
35 40 45 

Ala Leu Val He Pro Leu Ala He Leu He Asn He Gly Pro Arg Thr 
50 55 60 

5 Tyr Phe His Thr Cys Leu Met Val Ala Cys Pro Val Leu He Leu Thr 

65 70 75 80 

Gin Ser Ser He He Ala Leu Leu Ala He Ala Val Asp Arg Tyr Leu 
85 90 95 

Arg Val Lys He Pro Leu Arg Tyr Lys Thr Val Val Thr Pro Arg Arg 
10 100 105 110 

Ala Ala Val Ala He Ala Gly Cys Trp He Leu Ser Phe Val Val Gly 
115 120 125 

Leu Thr Pro Leu Phe Gly Trp Asn Arg Leu Gly Glu Ala Gin Arg Ala 
130 135 140 

15 Trp Ala Ala Asn Gly Ser Gly Gly Glu Pro Val He Lys Cys Glu Phe 

±45 150 155 160 

Glu Lys Val He Ser Met Glu Tyr Met Val Tyr Phe Asn Phe Phe Val 
165 170 175 

Trp Val Leu Pro Pro Leu Leu Leu Met Val Leu He Tyr Leu Glu Val 
20 180 185 190 

Phe Tyr Leu He Arg Arg Gin Leu Gly Lys Lys Val Ser Ala Ser Ser 
195 200 205 

Gly Asp Pro Gin Lys Tyr Tyr Gly Lys Glu Leu Lys He Ala Lys Ser 
210 215 220 

2 5 Leu Ala Leu He Leu Phe Leu Phe Ala Leu Ser Trp Leu Pro Leu His 

225 230 235 240 

He He Asn Cys He Thr Leu Phe Cys Pro Ser Cys Arg Lys Pro Ser 
245 250 255 

He Leu Met Tyr He Ala He Phe Leu Thr His Gly Asn Ser Ala Met 
30 260 265 270 

Pro He Val Tyr Ala Phe Arg He Gin Lys Phe Arg Val Thr Phe Leu 

275 280 285 

Lys He Trp Asn Asp His Phe Arg Cys Gin Pro Thr Pro Pro Val Asp 
290 295 300 

3 5 Glu Asp Pro Pro Glu Glu Ala Pro His Asp 

305 . 310 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 

4 0 (B) TYPE : amino acid 

(C) STRANDEDNESS : single 



Thr Gly Asn Leu Leu Val Leu lie Ser Phe Lys Val Asn Thr Glu Leu 
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20 25 30 

Lys Thr Val Asn Asn Tyr Phe Leu Leu Ser lie Ala Cys Ala Asp Leu 
35 40 45 

He He Gly Thr Phe Ser Met Leu Tyr Leu Leu Met His Trp Ala Leu 

5 50 55 60 

Gly Thr Leu Ala Cys Asp Leu Trp Leu Ala Leu Asp Tyr Val Ala Ser 
65 70 75 80 

Asn Ala Ser Val Leu Asn Leu Leu Leu He Ser Phe Asp Arg Tyr Phe 
85 90 95 

10 Ser Val Thr Arg Pro Leu Ser Tyr Arg Ala Lys Arg Thr Pro Arg Arg 

100 105 110 

Ala Ala He Met lie Gly He Ala Trp Leu Val Ser Phe Val Leu Trp 
115 120 125 

Ala Pro Ala He Leu Phe Trp Gin Tyr Leu Val Gly Glu Arg Thr Met 
15 130 135 140 

Leu Ala Gly Gin Cys Tyr He Gin Phe Leu Ser Gin Pro He lie Thr 
145 150 155 160 

Phe Gly Thr Ala Met Ala Ala Phe Tyr Met Pro Val Thr Val Met Thr 
155 170 175 

2 0 Leu Tyr Trp Arg He Tyr Arg Phe Thr Glu Asn Arg Ala Arg Glu Leu 

180 185 190 

Gin Gly Ser Glu Thr Pro Gly Lys Gly Gly Gly Ser Ser Ser Ser Ser 
195 200 205 

Glu Arg Ser Gin Pro Gly Ala Glu Gly Ser Pro Glu Thr Pro Lys Gly 
25 210 215 220 

Gin Lys Pro Arg Gly Lys Glu Leu Ala Lys Arg Lys Thr Phe Ser Leu 
225 230 235 240 

Val Lys Glu Lys Lys Ala Ala Arg Thr Leu Ser Ala He Leu Leu Ala 
245 250 255 

3 0 Phe He Leu Thr Trp Thr Pro Tyr Asn He Met Val Leu Val Ser Thr 

260 265 270 

Phe Cys Lys Asp Cys Val Pro Glu Thr Leu Trp Glu Leu Gly Tyr Trp 
275 280 285 

Leu He Cys Tyr Val Asn Ser Thr He Asn Pro Trp Tyr Ala Leu Cys 
35 290 295 300 

Asn Lys Ala Phe Arg Asp Thr Phe Arg Leu Leu Leu Leu Cys Trp Asp 
305 310 315 320 

Lys Arg Arg Trp Arg Lys lie Pro Lys Arg Pro Gly Ser Val His Arg 
325 330 335 

4 0 Thr Pro Ser Arg Gin Cys 

340 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
4 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Val Val Phe lie Val Leu Val Ala Gly Ser Leu Ser Leu Val Thr lie 
1 5 10 15 

lie Gly Asn lie Leu Val Met Val Ser lie Lys Val Asn Arg His Tyr 
20 25 30 

Phe Leu Phe Ser lie Ala Cys Ala Asp Leu lie lie Gly Val Phe Ser 
35 40 45 

Met Asn Leu Tyr Thr Leu Tyr Thr Val lie Gly Tyr Trp Pro Leu Gly 
50 55 60 

Pro Val Val Cys Asp Leu Tyr Val Val Ser Asn Ala Ser Val Met Asn 
65 70 75 80 

Leu Leu lie He Ser Phe Asp Arg Tyr Phe Cys Val Thr Lys Pro Leu 
15 85 90 95 

Thr Tyr Pro Vr 1 T.ys Arc* Thr Thr L^s Met Ala Gl" r Met Met II" 
100 " 105 J 110 " 

Ala Ala Trp Val Leu Ser Phe He Leu Trp Ala Pro Ala He Leu Phe 
115 120 125 

2 0 Trp Gin Phe He Val Gly Val Arg Thr Val Glu Asp Gly Glu Cys Tyr 

130 135 140 

He Gin Phe Phe Ser Asn Pro Ala Val Thr Phe Gly Thr Ala He Ala 
145 150 155 160 

Ala Phe Tyr Leu Pro Val He He Met He Val Leu Tyr Trp His He 
25 165 170 175 

Ser Arg Ala Ser Lys Ser Arg He Lys Lys Asp Lys Lys Glu Pro Val 
180 185 190 

Ala Asn Gin Asp Pro Val Ser Pro Ser Leu Val Gin Gly Arg He Val 
195 200 205 



30 



35 



4 0 



Lys Pro Leu Ser Ser Asp Asp Lys He Val Arg Arg Thr Lys Gin Pro 

210 215 220 

Ala Lys Lys Lys Pro Pro Pro Ser Arg Glu Lys Lys Val Thr Arg Thr 

225 230 235 240 

He Ala He Leu Leu Ala Phe He He Thr Trp Ala Pro Tyr Asn Val 

245 250 255 

Met Val Leu He Asn Thr Phe Cys Ala Pro Cys lie Pro Asn Thr Val 

260 265 270 

Trp Arg He Gly Tyr Trp Leu Cys Tyr He Asn Ser Thr He Asn Pro 

275 280 285 

1 A j ^ „yb ftbii Jt\ -i. o. in! r-iit: bVb Liy fc> int ^ne LVS rtiS 

"> o n "* 



i ,' SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 355 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Trp Phe He Ala Phe Leu Thr Gly He Leu Ala Leu Val Thr He He 

15 10 15 

Gly Asn He Leu Val lie Val Ser Phe Lys Val Asn Lys Gin Leu Lys 
20 25 30 

10 Thr Val Asn Asn Tyr Phe Leu Leu Ser Leu Ala Cys Ala Asp Leu lie 

35 40 45 

lie Gly Val lie Ser Met Asn Leu Phe Thr Thr Tyr lie lie Met Asn 
50 55 60 

Arg Trp Ala Leu Gly Asn Thr Ala Cys Asp Leu Trp lie Ala lie Asp 
15 65 70 75 80 

Tyr Val Ala Ser Asn Ala Ser Val Leu Asn Leu Leu Val lie Ser Phe 

85 90 95 

Asp Arg Tyr Phe Ser lie Thr Arg Pro Leu Thr Tyr Arg Ala Lys Arg 
100 105 110 

2 0 Thr Thr Lys Arg Ala Gly Val Met lie Gly Leu Ala Trp Vai lie Ser 

115 120 125 

Phe Val Leu Trp Ala Pro Ala lie Leu Phe Trp Gin Tyr Phe Val Gly 
130 135 140 

Lys Arg Thr Val Pro Pro Gly Glu Cys Phe He Gin Phe Leu Ser Glu 
25 145 150 155 160 

Pro Thr lie Thr Phe Gly Thr Ala lie Ala Ala Phe Tyr Met Pro Val 
165 170 175 

Thr lie Met Arg lie Leu Tyr Trp Arg lie Tyr Lys Glu Thr Glu Lys 
180 185 190 

3 0 Arg Thr Lys Glu Leu Ala Gly Leu Gin Ala Ser Gly Thr Glu Ala Glu 

195 200 205 

Thr Glu Asn Phe Val His Pro Thr Gly Ser Ser Arg Ser Cy* Ser Ser 

210 215 220 

Tyr Glu Leu Gin Gin Gin Lys Arg Phe Ala Leu Lys Thr Arg Ser Gin 
35 225 230 235 240 

lie Thr Lys Arg Lys Leu Leu Val Lys Glu Lys Lys Ala Ala Gin Thr 
245 250 255 

Leu Ser Ala lie Leu Leu Ala Phe lie lie Thr Trp Thr Pro Tyr Asn 
260 265 270 

4 0 lie Met Val Leu Val Asn Thr Phe Cys Asp Ser Cys lie Pro Lys Thr 

275 280 285 

Tyr Trp Asn Leu Gly Gly Tyr Trp Leu Cys Tyr lie Asn Ser Thr Val 
290 295 300 

Asn Pro Val Cys Tyr Ala Leu Cys Asn Lys Thr Phe Arg Thr Thr Phe 
45 305 310 315 320 

Lys Thr Leu Leu Leu Cys Gin Cys Asp Lys Arg Lys Arg Arg Lys Gin 
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325 330 335 

Gin Tyr Gin Gin Arg Gin Ser Val lie Phe His Lys Arg Val Pro Glu 
340 345 350 

Gin Ala Leu 
5 355 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Phe He Ala Thr Val Arg Gly Ser Leu Ser Leu Val Thr Val 
15 1 5 10 15 

Val Gly Asn He Leu Val Met Leu Ser He Lys Val Asn Arg Gin Leu 
20 25 30 

Gin Thr Val Asn Asn Tyr Phe Leu Phe Ser He Ala Cys Ala Asp Leu 
35 40 45 

He He Gly Ala Phe Ser Met Asn Leu Tyr Thr Val Tyr He He Lys 
50 55 60 

Gly Tyr Trp Pro Lau Gly Ala Trp Cys Asp Leu Trp Leu Ala Leu Asp 
65 70 75 80 

Tyr Val Val Ser Asn Ala Ser Val Met Leu Leu He He Ser Phe Asp 
25 85 90 95 

Arg Tyr Phe Cys Val Thr Lys Pro Leu Thr Tyr Pro Ala Arg Arg Thr 
100 105 11^ 

Thr Lys Met Ala Gly He Met He Ala Ala Ala Trp Val Leu Ser Phe 
115 120 125 



20 



30 



35 



40 



Val Leu Trp Ala Pro Ala He Leu Phe Trp Gin Phe Val Val Gly Lvs 
130 135 140 

Arg Thr Val Pro Asp Asn Gin Cys Phe He Gin Phe Leu Ser Asn Pro 
145 150 155 160 

Ala Val Thr Phe Gly Thr Ala He Ala Ala Phe Tyr Leu Pro Val Val 
165 170 175 

He Met He Val Leu Tyr He His He Ser Leu Ala Ser Arg Ser Arg 

ISO 185 190 

Val His Lys His Arg Pro Glu Gly Pro Lys Glu Lys Lys Al* Lys Thr 
195 200 205 

He Ala Phe Leu ^ys Ser Pro He Met Gin Ser Val Lys Lys Pro Pro 
21° 215 220 



Pro Gly Glu Ai a Lyq Phe Al a Ser lie Ala Ara Ann ^ ■ 



v'd i M '..J : ,V 



-a He Leu Leu Ala Phe He Leu Thr Trr; Thr Pro Tvr Asn Val Mer 
260 265 " J 270 
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Val Leu Val Asn Thr Phe Cys Gin Ser Cys He Pro Asp Thr Val Trp 
275 280 285 

Ser He Gly Tyr Trp Leu He Cys Tyr Val Asn Ser Thr He Asn Pro 
290 295 300 

5 Ala Cys Tyr Ala Leu Cys Asn Ala Thr Phe Lys Lvs Thr Phe Arg His 

305 310 315 ' 320 

Leu Leu Leu Cys Gin Arg Tyr Asn He Gly Thr Ala Arg 
325 330 

(2) INFORMATION FOR SEQ ID NO: 13: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

Val He Thr He Ala Val Val Thr Ala Val Val Ser Leu Met Thr He 

1 5 10 15 

Val Gly Asn Val Leu Val Met He Ser Phe Lys Val Asn Ser Gin Leu 
20 20 25 30 

Lys Thr Val Asn Asn Tyr Tyr Leu Leu Ser lie Ala Cys Ala Asp Leu 
35 40 45 

He He Gly He Phe Ser Met Asn Leu Tyr Thr Thr Tyr He Leu He 
50 55 60 

2 5 Met Gly Arg Trp Ala Leu Gly Ser Leu Ala Cys Asp Leu Trp Leu Ala 

65 70 75 80 

He Asp Tyr Val Ala Ser Asn Ala Ser Val Leu Asn Leu Leu Val lie 

85 90 95 



30 



Ser Phe Asp Arg Tyr Phe Ser He Thr Arg Pro Leu Thr Tyr Arg Ala 
100 105 110 

Lys Arg Thr Pro Lys Arg Ala Gly lie Met lie Gly lie Ala Trp Leu 
115 120 125 

lie Ser Phe lie Leu Trp Ala Pro Ala lie Leu Cys Trp Gin Tyr Leu 
130 135 140 

35 Val G1 Y Lys Arg Thr Val Pro lie Asp Glu Cys Gin lie Gin Phe Leu 

145 150 155 160 

Ser Glu Pro Thr lie Thr Phe Gly Thr Ala lie Ala Ala Phe Tyr lie 
165 170 175 

Pro v al Ser lie Met Arg lie Leu Tyr Cys Arg lie Tyr Arg Glu Thr 
40 180 185 190 

Glu Lys Arg Thr Lys Asp Leu Ala Asp Leu Gin Gly Ser Asp Ser Val 
195 200 205 

Tyr Lys Ala Glu Lys Arg Lys Pro Ala His Arg Ala Leu Phe Arg Ser 
210 215 220 

45 c y s L eu Arg Cys Pro Arg Pro Thr Lys Gly Leu Asn Pro Asn Pro Ser 

225 230 235 240 

His Gin Met Thr lys Arg Lys Arg Met Ser Leu Val Lys Glu Arg Lys 
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245 250 255 

Ala Ala Gin Thr Leu Ser Ala lie Leu Leu Ala Phe lie lie Thr Trp 
260 265 27'> 

Thr Pro Tyr Asn He Met Val Leu Val Ser Thr Phe Cys Asp Lys Cys 
275 280 285 

Val Pro Val Thr Leu Trp His Leu Gly Tyr Trp Leu Cys Tyr He Asn 
290 295 300 

Ser Thr Val Asn Pro He Cys Tyr Ala Leu Cys Asn Arg Thr Phe Arg 
305 310 315 320 

Lys Thr Phe He Met Leu Leu Cys Arg Trp Lys Lys Lys Lys Val Glu 
325 330 335 

Glu Lys Leu Tyr Trp Gin Gly Asn Ser Lys Leu Pro 
340 345 

(2) INFORMATION FOR SEQ ID NO: 14: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

2 0 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Thr Ala Gly Asp Cys Leu He Met Leu He Val Leu Leu He Val Ala 
15 10 15 

Gly Asn Val Leu Val He Val Ala He Ala Lys Thr Pro Arg Leu Gin 
25 20 25 30 

Thr Leu Thr Asn Leu Phe He Met Ser He Ala Ser Ala Asp Leu Val 
35 40 45 

Met Leu Leu Leu Val Val Pro Phe Cys Ala Thr Leu Val Val Trp Gly 
50 55 60 

3 0 Arg Trp Glu Tyr Gly Ser Phe Phe Cys Glu Leu Trp Thr Ser Val Asp 

65 70 75 80 

Val Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Val He Ala Leu 
85 90 95 

Asp Arg Tyr Leu Ala He Thr Ser Pro Phe Arg Tyr Gin Se- Leu Leu 

35 100 105 110 

Thr Arg Ala Arg Ala Arg Gly Leu Val Cys Thr Val Trp Ala He Ser 
115 120 125 

Ala Leu Val Ser Phe Leu Pro He Leu Leu Ser Asp Glu Ala Arg Arg 
130 135 140 



40 



Cys Tyr Asn Asp Pro Lys Cys Cys Asp Phe Val Thr Asn Arq Ala Tyr 
145 150 155 160 

Ala He Ala Ser Ser Val Val Ser Php Tvr Val Pro Leu Cvc t ] » mo t 



Asp Ser 
195 



Ara Arq Fne Leu 



Gl^ 



Mid 

205 



Ara Pro Pre 
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Ser Pro Ser Pro Ser Pro Val Pro Ala Pro Ala Pro Pro Gly Pro Pro 
210 215 220 

Arg Pro Ala Ala Ala Ala Ala Thr Ala Pro Leu Ala Asn Glv Arg Ala 
225 230 235 240 

5 Gly Lys Arg Arg Pro Ser Arg Leu Val Ala Leu Arg Glu Gin Lys Ala 

245 250 255 

Leu Lys Thr Leu Gly He He Met Gly Val Phe Thr Leu Cys Trp Leu 
260 265 270 

Pro Phe Phe His Arg Glu Leu Val Pro Asp Arg Leu Phe Val Phe Phe 
!0 275 280 285 

Asn Trp Leu Arg Tyr Ala Asn Ser Ala Phe Asn Pro He He Tyr Cys 

290 295 300 

Arg Ser Pro Asp Phe Arg Lys Ala Phe Gin Gly Leu Leu Cys Cys Ala 
305 310 315 320 

15 Arg Arg Ala Ala Arg Arg Arg His Ala Thr His Gly Asp Arg Pro Arg 

325 330 335 

Ala Ser Gly Cys He Ala Arg Pro Gly Pro Pro Ser Pro Gly Ala Ala 
340 345 350 

Ser Asp Asp Asp Asp Asp Asp Val Val Gly Ala Thr Pro Pro Ala Arg 
20 355 360 365 

Leu Leu Glu Pro Trp Ala Gly Cys Asn 
370 375 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 3 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) MOLECULE TYPE: peptide 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Val Val Gly He Val Met Ser Leu lie Val Leu Ala He Val Phe Gly 
1 5 10 15 

Asn Val Leu Val lie Thr Ala lie Ala Lys Phe Glu Arg Leu Gin Thr 
20 25 30 

3 5 Val Thr Asn Tyr Phe lie Thr Ser lie Ala Cys Ala Asp Leu Val Met 

35 40 45 

Gly Leu Ala Val Val Pro Phe Gly Ala Ala His lie Leu Met Lys Met 
50 55 60 

Trp Thr Phe Gly Asn Phe Trp Cys Glu Phe Trp Thr Ser lie Asp Val 

40 65 70 75 80 

Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Val He Ale. Val Asp 
85 90 95 

Arg Tyr Phe Ala He Thr Ser Pro Phe Lys Tyr Gin Ser Leu Leu Thr 
100 105 110 

4 5 Lys Asn Lys Ala Arg Val lie lie lie Met Val Trp lie Val Ser Gly 

115 120 125 

Leu Thr Ser Phe Leu Pro lie Leu Tyr Arg Ala Thr His Gin Glu Ala 
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130 135 140 

He Asn Cys Tyr Ala Asn Glu Thr Cys Cys Asp Phe Phe Thr Asn Gin 
145 150 155 160 

Ala Tyr Ala Ala Ser Ser Ala Val Ser Phe Tyr Val Pro Leu Val He 
5 165 170 175 

Met Val Phe Val Tyr Ser Arg Val Phe Gin Glu Ala Lys Arg Gin Leu 
180 185 190 

Gin Lys He Asp Lys Ser Glu Gly Arg Phe He Phe Val Gin Asn Leu 
195 200 205 

10 Ser Gin Val Glu Gin Asp Gly Arg Thr Gly His Gly Leu Arg Arg Ser 

210 215 220 

Ser Lys Phe Cys Leu Lys Glu His Lys Ala Leu Lys Thr Leu Gly He 
225 230 235 240 

He Pro Cys Thr Phe Thr Leu Cys Trp Leu Pro Phe Phe He Val Asn 
15 245 250 255 

He Val Val He Gin Asp Asn Leu He Arg Lys Glu Val Tyr He Leu 
260 265 270 

Leu Asn Trp He Gly Tyr Val Asn Ser Gly Phe Asn Pro Leu He Tyr 
275 280 285 

2 0 Cys Arg Ser Pro Asp Phe Arg He Ala Phe Gin Glu Leu Leu Cys Leu 

290 295 300 

Arg Arg Ser Ser Leu Lys Ala Tyr Gly Asn Gly Tyr Ser Ser Asn Gly 
305 310 315 320 

Asn Thr Gly Glu Gin Ser Gly Tyr His Val Glu Gin Glu Lys Glu Asn 
25 325 330 335 

Lys Leu Leu Cys Glu Asp Leu Pro Gly Thr Glu Asp Phe Val Gly His 
340 345 350 

Gin Gly Thr Val Pro Ser Asp Asn He Asp 
355 360 

3 0 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

3 5 ( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Ala Leu Ala Gly Ala Leu Leu Ala Leu Ala Val Leu Ala Thr Val 

15 10 15 

4 0 Gly Gly Asn Leu Leu Val He Val Ala He Ala Trp Thr Pro Arg Leu 

20 25 30 



;in Thr Met T 



Trp Pre Leu Gly Ala Thr Glv Cys Glu Leu Trp Thr Ser Val Asp Val 
65" ^ 70 75" 80 
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Leu Cys Val Thr Ala Ser lie Glu Thr Leu Cys Ala lie Ala Val Asp 
85 90 95 

Arg Tyr Leu Ala Val Thr Asn Pro Leu Arg Tyr Gly Ala Leu Val Thr 
100 105 110 

5 Lys Arg Cys Ala Arg Thr Ala Trp Leu Val Trp Val Val Ser Ala Ala 

115 120 125 

Val Ser Phe Ala Pro lie Met Ser Gin Trp Trp Arg Val Gly Ala Asp 
130 135 140 

Ala Glu Ala Gin Arg Cys His Ser Asn Pro Arg Cys Cys Ala Phe Ala 
10 145 150 155 160 

Ser Asn Met Pro Tyr Ala Val Leu Leu Ser Ser Ser Val Ser Phe Tyr 
165 170 175 

Leu Pro Leu Leu Leu Phe Val Tyr Ala Arg Val Phe Trp Ala Thr Arg 
180 185 190 

15 Gin Leu Arg Leu Leu Arg Gly Glu Leu Gly Arg Phe Pro Pro Glu Glu 

195 200 205 

Ser Pro Pro Ala Pro Ser Arg Ser Leu Ala Pro Ala Pro Val Gly Thr 
210 215 220 

Gly Ala Pro Pro Glu Gly Val Pro Ala Cys Gly Arg Pro Pro Ala Arg 
20 225 230 235 240 

Leu lie Pro lie Arg Glu His Arg Ala Leu Cys Thr Leu Gly Leu lie 
245 250 255 

Met Gly Thr Phe Thr Leu Cys Trp Leu Pro Phe Phe lie Ala Asn Val 
260 265 270 

25 Leu Arg Ala Leu Gly Gly Pro Ser Leu Val Pro Gly Pro Ala Phe Leu 

275 280 285 

Ala Leu Asn Trp Leu lie Gly Tyr Ala Asn Ser Ala Phe Asn Pro Leu 
290 295 300 

lie Tyr Cys Arg Ser Pro Asp Phe Arg Ser Ala Phe Arg Arg Leu Leu 

30 305 310 315 320 

Cys Arg Cys Gly Arg Arg Leu Pro Pro Glu Pro Cys Ala Ala Ala Arg 
325 330 335 

Pro Ala Leu Phe Pro Ser Gly Val Pro Ala Ala Glu Ser Ser Pro Ala 
340 345 350 

3 5 Gin Pro Arg Leu Cys Gin Arg Leu Asp Gly 

355 360 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
45 Ala He Leu Leu Gly Val He Leu Gly Gly Leu He Leu Phe Gly Val 

15 10 15 

Leu Gly Asn He Leu Val lie Leu Ser Val Ala Cys His Arg His Leu 
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20 25 30 

His Ser Val Thr His Tyr Tyr lie Val Asn Leu Ala Val Ala Asp Leu 
35 40 45 

Leu Leu Thr Ser Thr Val Leu Pro Phe Ser Ala lie Phe Glu lie Leu 
5 50 55 60 

Gly Tyr Trp Lys Phe Gly Arg Val Phe Cys Asn Val Trp Ala Ala Val 
65 70 75 80 

Asp Val Leu Cys Cys Thr Ala Ser lie Met Leu Leu Cys lie lie Ser 
85 90 95 

10 lie Asp Arg Tyr lie Gly Val Ser Tyr Pro Leu Arg Tyr Pro Thr lie 

100 105 110 

Val Thr Gin Lys Arg Gly Leu Met Ala Leu Leu Cys Val Trp Ala Leu 
115 120 125 

Ser Leu Val lie Ser lie Gly Pro Leu Phe Gly Trp Arg Gin Pro Ala 
15 130 135 140 

Pro Glu Asp Glu Thr lie Cys Gin lie Asn Glu Glu Fro Gly Tyr Val 
145 150 155 160 

Leu Phe Ser Ala Leu Gly Ser Phe Tyr Val Pro Leu Thr lie lie Leu 
165 170 175 

2 0 Val Met Tyr Cys Arg Val Tyr Val Val Ala Lys Arg Glu Ser Arg Gly 

180 185 19'1 

Leu Lys Ser Gly Leu Lys Thr Asp Lys Ser Asp Ser Glu Gin Val Thr 
195 200 205 

Leu Arg lie His Arg Lys Asn Ala Gin Val Gly Gly Ser Gly Val Thr 
25 210 215 220 

Ser Ala Lys Asn Lys Thr His Phe Ser Val Arg Leu Leu Lys Phe Ser 
225 230 235 240 

Arg Glu Lys Lys Ala Ala Lys Thr Leu Gly lie Val Val Gly Cys Phe 
245 250 255 

3 0 Val Leu Cys Trp Leu Pro Phe Phe Leu Val Met Pro lie Gly Ser Phe 

260 265 270 

Phe Pro Asp Phe Arg Pro Ser Glu Thr Val Phe Lys lie Ala Phe Trp 
275 280 285 

Leu Gly Tyr lie Asn Ser Cys lie Asn Pro lie lie Tyr Pro Cys Ser 
35 290 295 300 

Ser Gin Glu Phe Lys Lys Ala Phe Gin Asn Val Leu Arg lie Gin Cys 

305 310 315 320 

Leu Arg Arg Lys Gin Ser Ser Lys His Thr Leu Gly Tyr Thr Leu His 
325 330 335 

4 0 Ala. Pro Ser His Val Leu Glu Gly Gin His Lys Asp Leu Val Arcj lie 



Val Cvs Glu Trp Lys He Phe 
3 7 0 3 7 5 
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(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 70 ammo acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala lie Ser Val Gly Leu Val Leu Gly Ala Phe lie Leu Phe Ala lie 
10 1 5 10 15 

Val Gly Asn lie Leu Val lie Leu Ser Val Ala Cys Asn Arg His Leu 
20 25 30 

Arg Thr Pro Thr Asn Tyr Phe lie Val Asn lie Ala lie Ala Asp Leu 
35 40 45 

15 Leu Leu Ser Phe Thr Val Leu Pro Phe Ser Ala Thr Leu Glu Val Leu 

50 55 60 

Gly Tyr Trp Val Leu Gly Arg lie Phe Cys Asp He Trp Ala Ala Val 
65 70 75 80 

Asp Val Leu Cys Cys Thr Ala Ser He Leu Ser Leu Cys Ala He Ser 
20 85 90 95 

He Asp Arg Tyr He Gly Val Arg Tyr Ser Leu Gin Tyr Pro Thr Leu 
100 105 110 

Val Thr Arg Arg Tyr Ala He He Ala Leu Leu Ser Val Trp Val Leu 
115 120 125 

25 Ser Thr Val He Ser He Gly Pro Leu Leu Gly Trp Lys Glu Pro Ala 

130 135 140 

Pro Asn Asp Asp Lys Glu Cys Val Thr Glu Glu Pro Phe Leu Phe Cys 
145 150 155 160 

Ser Leu Gly Ser Phe Tyr He Pro He Ala Val He Leu Val Met Tyr 
30 165 170 175 

Cys Arg Val Tyr He Val Ala Lys Arg Thr Thr Lys Asn Leu Glu Ala 
180 185 190 

Gly Val Met Lys Glu Met Ser Asn Ser Lys Phe Leu Thr Leu Arg He 
195 200 205 

3 5 His Trp Ser Lys Asn Phe His Glu Asp Thr Leu Ser Ser Thr Lys Ala 

210 215 220 

Lys Gly His Asn Pro Arg Ser Ser He Ala Val Lys Leu Phe Lvs Phe 
225 230 235 " 240 

Ser Arg Glu Lys Lys Ala Ala Lys Thr Leu Gly He Val Val Gly Trp 
40 245 250 255 

He Leu Cys Trp Leu Pro Phe Phe He Ala Leu Pro Leu Gly Ser Leu 
260 265 270 

Phe Ser Thr Leu Lys Pro Pro Asp Ala Val Phe Lys Trp Phe Trp Leu 
275 280 285 

45 Gly Tyr Phe Asn Ser Cys Leu Asn Pro He He Tyr Pro Cys Ser Ser 

290 295 300 

Lys Glu Phe Lys Arg Ala Leu Leu Gly Cys Gin Cys Arg Gly Gly Arg 



BNSDOCID: <WO 9405695A1 I > 



WO 94/05695 PCT/US93/08528 

- 71 - 

305 310 315 320 

Arg Arg Arg Arg Arg Arg Arg Leu Ala Cys Ala Tyr Thr Tyr Arg Pro 
325 330 335 

Trp Thr Arg Gly Gly Ser Leu Glu Arg Ser Gin Ser Arg Lys Asp Ser 
5 340 345 350 

lie Asp Asp Ser Gly Ser Cys Met Ser Gly Gin Lys Arg Thr Leu Pro 
355 360 365 

Ser Ala 

370 

10 (2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Vdi >^j.d *jiy j_itiU i-i.J.d >Lj.ci vai vcij. uiy truer ucu 11c vox cue mi. vqj. 

15 10 15 

2 0 Val Gly Asn Val Leu Val Val He Ala Val Leu Thr Ser Arg Ala Leu 

20 25 30 

Arg Ala Pro Gin Asn Leu Phe Leu Val Ser He Ala Ser Ala Asp He 
35 40 45 

Leu Val Ala Thr Leu Val Met Pro Phe Ser Leu Ala Asn Glu He Met 
25 50 55 60 

Tyr Trp Tyr Phe Gly Gin Val Trp Cys Gly Val Tyr Leu Ala He Asp 
65 70 75 80 

Val Leu Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser Leu 
85 90 95 

3 0 Asp Arg Tyr Trp Ser Val Thr Gin Ala Val Glu Tyr Asn Leu Lys Arg 

100 105 110 

Thr Pro Arg Arg Val Lys Ala Thr He Val Ala Val Trp Leu He Ser 
115 120 125 

Ala Val He Ser Phe Pro Pro Leu Val Ser Leu Tyr Arg Gin Pro Asp 
35 130 135 140 

Gly Ala Ala Tyr Pro Gin Cys Gly Leu Asn Asp Glu Thr Trp Tyr He 
145 150 155 160 

Leu Ser Ser Cys He Gly Ser Phe Phe Ala Pro Cys Leu Ha Tyr Leu 
165 170 175 

4 0 Leu Val Tyr Ala Arg He Tyr Arg Val Ala Lys Arg Arg Thr Arg Thr 

180 185 190 

Leu Ser Glu Lys Arg Ala Pro Val Gly Pro Asp Gly Ala Ser Pro Thr 

195 200 205 



Arc Lys Val Ala Gin Ala Arc 



blu Lys Arg Phe Thr Phe Val Leu Ala 
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245 250 255 

Leu Val Phe Val Leu Cys Trp Phe Pro Phe Phe Phe lie Tyr Ser Leu 
260 265 270 

Tyr Gly lie Cys Arg Glu Ala Cys Gin Val Pro Gly Pro Leu Phe Lys 
5 275 280 285 

Phe Phe Phe Trp lie Gly Tyr Cys Asn Ser Ser Leu Asn Pro Val lie 
290 295 300 

Tyr Thr Val Phe Asn Gin Asp Phe Arg Pro Ser Phe Lys His lie Leu 
305 310 315 320 

10 Phe Arg Arg Arg Arg Arg Gly Phe Arg Gin 

325 330 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

2 0 Thr Ala Ala lie Ala Ala Ala lie Thr Phe Leu lie Leu Phe Thr lie 

15 10 15 

Phe Gly Asn Ala Leu Val lie lie Ala Val Leu Thr Ser Arg Ser Leu 
20 25 30 

Arg Ala Pro Gin Asn Leu Phe Leu Val Ser lie Ala Ala Ala Asp lie 
25 35 40 45 

Leu Val Ala Thr Leu lie lie Pro Phe Ser Leu Ala Asn Glu Leu Leu 

50 55 60 

Gly Tyr Trp Tyr Phe Arg Arg Thr Trp Cys Glu Val Tyr Leu Ala Leu 
65 70 75 80 

3 0 Asp Val Leu Phe Cys Thr Ser Ser lie Val His Leu Cys Ala lie Ser 

85 90 95 

Leu Asp Arg Tyr Trp Ala Val Ser Arg Ala Leu Glu Tyr Asn Ser Lys 
100 105 110 

Arg Thr Pro Arg Arg lie Lys Cys He He Leu Thr Val Trp Leu He 
35 115 120 125 

Ala Ala Val He Ser Leu Pro Pro Leu He Tyr Lys Gly Asp Gin Gly 
130 135 140 

Pro Gin Pro Arg Gly Arg Pro Gin Cys Lys Leu Asn Gin Glu Ala Trp 

145 150 155 160 

40 Tyr He Leu Ser Ser He Gly Ser Phe Phe Ala Pro Cys Leu He Leu 

165 170 175 

Leu Val Tyr Leu Arg He Tyr Leu He Ala Lys Arg Ser Asn Arg Arg 
180 185 190 

Gly Pro Arg Ala Lys Cys Gly Pro Gly Gin Gly Glu Ser Lys Gin Pro 
45 195 200 205 

Arg Pro Asp His Gly Gly Ala He Ala Ser Ala Lys Leu Pro Ala He 
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210 215 220 

Ala Ser Gly Arg Gly Val Gly Ala lie Gly Gly Gin Trp Trp Arg Arg 
225 230 235 240 

Arg Ala His Val Thr Arg Glu Lys Arg Phe Thr Phe Val Leu Ala Val 
5 245 250 255 

Val lie Gly Val Phe Val Leu Cys Trp Phe Pro Phe Phe Phe Ser Tyr 
260 265 270 

Ser Leu Gly Ala lie Cys Pro Lys His Cys Lys Val Pro His Gly Leu 
275 280 285 

10 Phe Gin Phe Phe Phe Trp lie Gly Tyr Cys Asn Ser Ser Leu Asn Pro 

290 295 300 

Val lie Tyr Thr lie Phe Asn Gin Asp Phe Arg Met Phe Arg Arg lie 
305 310 315 320 

Leu Cys Arg Pro Trp Thr Gin Thr Ala Trp 
15 325 330 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 amino acids 

(B) TYPE: amino acid 

2 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Thr Leu Thr Leu Val Cys lie Ala Cys Leu Ser Leu Thr Val Phe Gly 
25 1 5 10 15 

Asn Val Leu Val lie lie Ala Val Phe Thr Ser Arg Ala Leu Lys Ala 
20 25 30 

Pro Gin Asn Leu Phe Leu Val Ser lie Ala Ser Ala Asp lie Leu Val 
35 40 45 

3 0 Ala Thr Leu Val lie Pro Phe Ser Leu Ala Asn Glu Val Asn Gly Tyr 

50 55 60 

Trp Tyr Phe Gly Lys Trp Cys Glu lie Tyr Leu Ala Leu Asp Val Leu 
65 70 75 80 

Phe Cys Thr Ser Ser lie Val His Leu Cys Ala lie Ser Leu Asp Arg 
35 85 90 95 

Tyr Trp Ser lie Thr Gin Ala lie Glu Tyr Asn Leu Lys Arg Thr Pro 
100 105 110 

Arg Arg lie Lys Ala lie lie lie Thr Val Trp Val lie Ser Ala Val 
115 120 125 

40 He Ser Phe Pro Pro Leu He Ser lie Glu Lys Lys Gly Gly Gly Gly 

130 135 140 

Giv Pro Gin Pro Ala Glu Pro Arc: fvs Glu He Asn Asn Gin Lys Tm 



180 



Gin I ; 
185 



^rq nrg 

19C 
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Val Pro Pro Ser Arg Arg Asp Pro Asp Ala Val Ala Ala Pre Pro Gly 
195 200 205 

Gly Thr Glu Arg Arg Pro Asn Gly Leu Gly Pro Glu Arg Ser Ala Gly 
210 215 220 

5 Pro Gly Gly Gly Arg Gly Arg Ser Ala Ser Gly Leu Pro Arg Arg Arg 

225 230 235 240 

Ala Gly Ala Gly Gly Gin Asn Arg Glu Lys Arg Phe Thr Phe Val lie 
245 250 255 

Ala Val Val lie Gly Val Phe Val Val Cys Trp Phe Pro Phe Phe Phe 
10 260 265 270 

Thr Tyr Thr Leu Thr Ala Val Leu Cys Ser Val Pro Arg Thr Leu Phe 
275 280 285 

Lys Phe Phe Phe Trp Phe Gly Tyr Cys Asn Ser Ser Leu Asn Pro Val 
290 295 300 

15 lie Tyr Thr lie Phe Asn His Asp Phe Arg Arg Ala Phe Lys Lys lie 

305 310 315 320 

Leu Cys Arg Gly Asp Arg Lys Arg lie Val 
325 330 

<2) INFORMATION FOR SEQ ID NO: 22: 
2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

2 5 <ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Thr Leu Thr Leu Val Cys lie Ala Gly Leu lie Met Leu Phe Thr Val 

15 10 15 

Phe Gly Asn Val Leu Val lie lie Ala Val Phe Thr Ser Arg Ala Leu 

30 20 25 30 

Lys Ala Pro Gin Asn Leu Phe Leu Val Ser lie Ala Ser Ala Asp lie 
35 40 45 

Leu Val Ala Thr Leu Val lie Pro Phe Ser Leu Ala Asn Glu Val Met 
50 55 60 

3 5 Tyr Trp Tyr Phe Gly Lys Val Trp Cys Glu lie Tyr Leu Ala lie Asp 

65 70 75 80 

Val Leu Phe Cys Thr Ser Ser lie Val His Leu Cys Ala lie Ser Leu 
85 90 95 

Asp Arg Tyr Trp Ser lie Thr Gin Ala lie Glu Tyr Asn Leu Lys Arg 
40 100 105 110 

Thr Pro Arg Arg He Lys Ala He He Val Thr Val Trp Val He Ser 
115 120 125 

Ala Val He Ser Phe Pro Pro Leu Leu He Ser He Glu Lys Lys Gly 
130 135 140 

45 Ala Gly Gly Gly Gin Gin Pro Ala Glu Pro Ser Cys Lys lie Asn Asp 

145 150 155 160 

Gin Lys Trp Tyr Val He Ser Ser Ser He Gly Ser Phe Phe Ala Pro 
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165 1*?0 175 

Cvs Leu He Asn His Leu Val Tvr Val Arg He Tyr Gin He Ala Lys 
180 185 190 

Arg Arg Thr Arg Val Pro Pro Ser Arg Arg Gly Pro Asp Ala Cys Ser 
5 195 200 205 

Ala Pro Pro Gly Gly Ala Asp Arg Arg Pro Asn Ala Val Gly Pro Glu 
210 215 220 

Arg Gly Ala Gly Thr Ala Gly Gly Gin Gly Glu Glu Arg Ala Gly Gly 
225 230 235 240 

10 Ala Lys Ala Ser Arg Trp Arg Gly Arg Gin Asn Arg Glu Lys Arg Phe 

245 250 255 

Thr Phe Val He Ala Val Val He Gly Val Phe Val Val Cys Trp Phe 
260 265 270 

Pro Phe Phe Phe Thr Tyr Thr Leu He Ala Val Gly Cys Pro Val Pro 
15 275 280 285 

Tyr Gin Leu Phe Asn Phe Phe Phe Trp Phe Gly Tyr cys Asn Ser Ser 
290 295 300 

Leu Asn Pro Val He Tyr Thr He Phe Asn His Asp Phe Arg Arg Ala 
305 310 315 320 

20 Phe Lys Lys He Leu Cys Arg Gly Asp Arg Lys Arg He Val 

325 330 

(2) INFORMATION FOR SEQ ID NO:23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 21 amino acids 

2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

3 0 Leu Leu Thr Ala Leu Val Leu Ser Val He He Val Leu Thr He He 

15 10 15 

Gly Asn He Leu Val He Leu Ser Val Phe Thr Tyr Lys Pro Leu Arg 
20 25 30 

He Val Gin Asn Phe Phe He Val Ser He Ala Val Ala Asp Leu Thr 
35 35 40 45 

Val Ala Leu Leu Val Leu Pro Phe Trp Ala Tyr Ser He Leu Gly Arg 
50 55 60 

Trp Glu Phe Gly He His Leu Cys Lys Leu Trp Leu Thr Cys Asp Val 
65 70 75 80 

4 0 Leu Cys Cys Thr Ser Ser He Leu Asn Leu Cys Ala He Ala Leu Asp 

85 90 95 



Leu He Ser Ser I ro Pro Leu lie Gly Trp Asn Asp Trp Pre Asp Glu 
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130 135 140 

Phe Thr Ser Ala Thr Pro Cys Glu Leu Thr Ser Gin Arg lie Gly Tyr 
145 150 155 160 

Val lie Tyr Ser Ser Leu Gly Ser Phe Phe lie Pro lie Ala lie Met 
5 165 170 175 

Arg lie Val Tyr lie Glu lie Phe Val Ala Thr Arg Arg Arg Leu Arg 
180 185 190 

Glu Arg Ala Arg Ala Asn Lys lie Asn Thr He Ala Leu Lys Ser Thr 
195 200 205 

10 Glu Leu Glu Pro Met Ala Asn Ser Ser Pro Val Ala Ala Ser Asn Ser 

210 215 220 

Gly Ser Lys Lys Lys Thr Ser Gly Val Asn Gin Phe He Glu Glu Lys 
225 230 235 240 

Gin Lys lie Ser Leu Ser Lys Glu Arg Arg Ala Ala Arg Thr Leu Gly 
15 245 250 255 

He He Met Val the Val He Cys Trp Leu Pro Phe Phe He Met Tyr 
260 265 270 

Val He Leu Pro Phe Cys Cys Pro Thr Asn Lys Phe Lys Asn Phe He 
275 280 285 

2 0 Thr Trp Leu Gly Tyr He Asn Ser Gly Leu Asn Pro Val He Tyr Thr 

290 295 300 

He Phe Asn Leu Asp Tyr Arg Arg Ala Phe Lys Arg Leu Leu Gly Leu 
305 310 315 320 



25 



Asn 



(2) INFORMATION FOR SEQ ID NO; 24 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Arg He Leu Thr Ala Cys Phe Leu Ser Leu Leu lie Leu Ser Thr Leu 
35 1 5 10 15 

Leu Gly Asn Thr Leu Val Cys Ala Ala Val He Arg Phe Arg His Leu 
20 25 30 

Arg Ser Lys Val Thr Asn Phe Phe Val He Ser Leu Ala Val Ser Asp 
35 40 45 

4 0 Leu Leu Val Ala Val Leu Leu Trp Lys Ala Val Ala Glu lit Ala Gly 

50 55 60 

Phe Trp Pro Phe Gly Ser Phe Cys Asn He Trp Val Ala Phe Asp He 
65 70 75 80 

Met Cys Ser Thr Ala Ser He Leu Asn Leu Cys Val He Ser Val Asp 
45 85 90 95 

Arg Tyr Trp Ala lie Ser Ser Pro Phe Arg Tyr Glu Arg Lys Lys Arg 
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100 105 110 

Pro Lys Ala Ala Phe lie Leu lie Ser Val Ala Trp Thr Leu Ser Val 
115 120 125 

Leu lie Ser Phe lie Pro Val Gin Leu Ser Trp His Lys Ala Lys Pro 
5 130 135 140 

Thr Ser Pro Ser Asp Gly Met Ala Thr Ser Leu Ala Glu Thr lie Asp 
145 150 155 160 

Asn Cys Asp Ser Ser Leu Ser Arg Thr Tyr Ala lie Ser Ser Ser Val 
165 170 175 

10 lie Ser Phe Tyr lie Pro Val Ala lie Leu Val Thr Tyr Thr Arg lie 

180 185 190 

Tyr Arg lie Ala Gin Lys Gin lie Arg Arg lie Ala Ala Leu Glu Arg 
195 200 205 

Ala Ala Val His Ala Lys Asn Cys Gin Gly Asn Lys Pro Val Glu Cys 
15 210 215 220 

Ser Gin Pro Glu Ser Ser Phe Met Ser Phe Lys Arg Glu Thr Lys Val 
225 230 235 240 

Leu Lys Thr Leu Ser Val He Thr Cys Val Phe Val Cys Cys Trp Leu 
245 250 255 

20 Pro Phe Phe He Leu Asn Cys He Leu Pro Phe Cys Gly Se<>- Gly Glu 

260 265 270 

Thr Gin Pro Phe Cys Thr Asp Ser Asn Thr Phe Asp Val Phe Val Trp 
275 280 285 

Phe Gly Trp Ala Asn Ser Ser Leu Asn Pro He He Tyr Ala Phe Asn 
25 290 295 300 

Ala Asp Phe Arg Lys Ala Phe Ser Thr Leu Leu Gly Cys Tyr Arg Leu 
305 310 315 320 

Cys Pro Ala Thr Asn Met Ala He Glu Thr Val Ser lie Asn Asn Gly 
325 330 335 

3 0 Ala Ala Met Phe Ser Ser His His Glu Pro Arg Gly Ser He Ser Lys 

340 345 350 

Glu Cys Asn Leu Val Tyr Leu He Pro His Ala Val Gly Ser Ser Glu 
355 360 365 

Asp Leu Lys Lys Glu 
35 370 

{2} INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 
40 (C) STRANDEDNESS : single 

( D ) TOPOLOGY ; 1 i near 



Glv Asn Val Leu Val Cvs Ala Ala lie Val Arg Sex' Arg Hi^ Leu Leu 
20 ^ 25 3C 
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Val Phe lie Val Ser lie Ala Val Ser Asp Leu Phe Val Ala Leu Leu 
35 40 45 

Val Asn Thr Trp Lys Ala Tyr Ala Glu Val Ala Gly Tyr Trp Pro Phe 
50 55 60 

5 Gly Ala Phe Cys Asp Val Trp Val Ala Phe Asp lie Met Cys Ser Thr 

65 70 75 80 

Ala Ser lie Leu Asn Leu Cys Val lie Ser Val Asp Arg Tyr Trp Ala 
85 90 95 

lie Ser Arg Pro Phe Arg Tyr Lys Ala Leu Val Met Val Gly lie Ala 
10 100 105 11C 

Trp Thr Leu Ser lie Leu lie Ser Phe lie Pro Val Gin lie Asn Trp 
115 120 125 

Asn Arg Asp Gin Ala Ala Ser Trp Gly Gly Leu Asp Leu Pro Asn Asn 
130 135 140 

15 He Asp Cys Asp Ser Ser Leu Asn Arg Thr Tyr Ala He Ser Ser Ser 

145 150 155 160 

Leu He Ser Phe Tyr He Pro Val Ala He Leu Val Thr Tyr Thr Arg 
165 170 175 

He Tyr Arg He Ala Gin Val Gin He Arg Arg He Ser Ser Leu Glu 
20 180 185 190 

Arg Ala Ala Glu His Ala Gin Ser Cys Arg Ser Ser Ala Ala Cys Ala 
195 200 205 

Pro Asp Thr Ser Leu Arg Ala Ser He Lys Lys Glu Thr Lys Val Leu 
210 215 220 

2 5 Lys Thr Leu Ser Val He He Cys Val Phe Val Cys Cys Trp Leu Pro 

225 230 235 240 

Phe Phe He Leu Asn Cys Met Val Pro Phe Cys Ser Gly Hi z Pro Glu 
245 250 255 

Gly Pro Pro Ala Gly Phe Pro Cys Val Ser Glu Thr Thr Phe Asp Val 
30 260 265 270 

Phe Val Trp Phe Gly Trp Ala Asn Ser Ser Leu Asn Pro Val He Tyr 
275 280 285 

Ala Phe Asn Ala Asp Phe Gin Lys Val Phe Ala Gin Leu Leu Cys Ser 
290 295 300 

3 5 His Phe Cys Ser Arg Thr Pro Val Glu Thr Val Asn He Ser Asn Glu 

305 310 315 320 

Leu He Ser Tyr Asn Gin Asp He Val Phe His Lys Glu He Ala Ala 
325 330 335 

Ala Tyr lie His Met Met Pro Asn Ala Val Thr Pro Gly Asn Arg Glu 
40 340 345 350 

Val Asp Asn Asp Glu Glu Glu Gly 
355 360 

(2) INFORMATION FOR SEQ ID NO : 2 6 : 
(l) SEQUENCE CHARACTERISTICS : 

4 5 (A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(11) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 : 
5 Tyr Asn Tyr Tyr Ala Thr Leu Leu Thr Leu Leu lie Ala Va^ lie Val 

15 10 15 

Phe Gly Asn Val Leu Val Cys Met Ala Val Ser Arg Glu Lys Ala Leu 
20 25 30 

Gin Thr Met Asn Tyr Leu lie Val Ser lie Ala Val Ala Asp Leu Leu 
10 35 40 45 

Val Ala Thr Leu Val Trp Trp Trp Tyr Leu Glu Val Val Gly Glu Trp 
50 55 60 

Lys Phe Ser Arg lie His Cys Asp lie Phe Val Thr Leu Asp lie Thr 
65 70 75 80 

15 Ala Ser lie Leu Asn Leu Cys Ala lie Ser lie Asp Arg Tyr Thr Ala 

85 90 95 

Val Ala Met Pro Met Leu Tyr Asn Thr Arg Tyr Ser Ser Lys Arg Arg 
100 105 110 

Val Thr Val Met lie Ser lie Val Trp Val Leu Ser Phe Thr lie Ser 
20 115 120 125 

Cys Pro Leu Leu Phe Gly Leu Asn Asn Ala Asp Gin Asn Glu Cys lie 
130 135 140 

lie Ala Asn Pro Ala Phe Val Val Tyr Ser Ser lie Val Se. Phe Tyr 
145 150 155 160 

2 5 Val Pro Phe lie Val Thr Leu Leu Val Tyr He Lys He Tyr He Val 

165 170 175 

Leu Arg Arg Arg Arg Lys Arg Val Asn Thr Lys Arg Ser Ser Arg Ala 
180 185 190 

Phe Arg Ala His Leu Arg Ala Pro Leu Lys Gly Asn Cys Thr His Pro 
30 195 200 205 

Glu Asp Met Lys Leu Cys Thr Val He Pro Asn Gly Lys Thr Arg Thr 
210 215 220 

Ser Leu Lys Thr Met Ser Arg Arg Lys Leu Ser Gin Gin Lys Glu Lys 
225 230 235 240 

3 5 Lys Ala Thr Gin Met lie Ala He Val Leu Gly Val Phe He He Cys 

245 250 255 

Lys Leu Pro Phe Phe lie Thr His He Leu Asn He His Cys Asp Cys 
260 265 270 

Asn He Pro Pro Val Leu Tyr Ser Ala Phe Thr Trp Leu Gly Tyr Val 

40 275 280 285 



4 5 ;2) INFORMATION FOR SEQ ID NO : 2 7 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 317 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
5 (li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 7 : 

Ala Tyr Tyr Ala Leu Ser Tyr Cys Ala Leu lie Leu Ala lie Val Phe 
15 10 15 

Gly Asn Gly Leu Val Cys Met Ala Val Leu Arg Glu Lys Ala Leu Gin 

10 20 25 30 

Thr Thr Thr Asn Tyr Leu Val Val Ser Leu Ala Val Ala Asp Leu Leu 
35 40 45 

Val Ala Thr Leu Val Trp Trp Val Val Tyr Leu Glu Val Thr Gly Gly 
50 55 60 

15 Val Trp Asn Phe Ser Arg lie Cys Cys Asp Val Phe Val Thr Leu Asp 

65 70 75 80 

Val Met Met Thr Ala Ser lie Leu Asn Leu Cys Ala lie Ser lie Asp 
85 90 95 

Arg Tyr Thr Ala Val His Tyr Gin His Gly Thr Gly Gin Ser Ser Cys 
20 100 105 110 

Arg Arg Val Ala lie Met lie Thr Ala Val Trp Val Leu Ala Phe Ala 
115 120 125 

Val Ser Cys Pro Leu Leu Phe Gly Phe Asn Thr Gly Asp Pro Thr Val 
130 135 140 

2 5 Cys Ser lie Ser Asn Pro Asp Phe Val lie Tyr Ser Ser Val Val Ser 

145 150 155 160 

Phe Tyr Leu Pro Phe Gly Val Thr Val Leu Val Tyr Ala Arg lie Tyr 
165 170 175 

Val Val Leu Lys Gin Arg Arg Arg Lys Arg lie Leu Thr Arg Gin Asn 
30 180 185 190 

Ser Gin Cys Asn Ser Val Arg Pro Gly Phe Pro Gin Gin Ser Thr Ser 
195 200 205 

Leu Pro Asp Pro Ala His Leu Glu Leu Lys Arg Ser Asn Gly Arg Leu 
210 215 220 

3 5 Ser Thr Ser Leu Lys Leu Pro Leu Gin Pro Arg Gly Val Pro Leu Arg 

225 230 235 240 

Glu Lys Lys Ala Thr Gin Met Val Ala lie Val Leu Gly Ala Phe lie 
245 250 255 

Val Cys Trp Leu Pro Phe Phe Leu Thr His Val lie Asn Thr His Cys 
40 260 265 270 

Gin Thr Cys His Val Ser Pro Glu Leu Tyr Ser Ala Thr Thr Trp Leu 
275 280 285 

Gly Tyr Val Asn S«r Ala Leu Asn Pro Val lie Tyr Thr Thr Phe Asn 
290 295 300 

4 5 lie Glu Phe Arg Lys Ala Phe Leu Lys He Leu Ser Cys 

305 310 315 
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(2) INFORMATION FOR SEQ ID NO: 28: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
di) MOLECULE TYPE: peptide 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 

Gly Ala Ala Ala Leu Val Gly Gly Val Leu Leu lie Cys Ala Val Leu 
15 10 15 

Ala Gly Asn Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu 
20 25 30 

Gin Thr Pro Thr Asn Ser Phe lie Val Ser Leu Ala Ala Ala Asp Leu 
35 40 45 

Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val Gin 
50 55 60 

Gly Ala Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Val Met Leu Cys 

65 70 75 80 

Thr Ala Ser lie Phe Asn Leu Cys Ala He Ser Val Asp Ar*- Phe Val 
85 90 95 

Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly Ser Arg Arg 
100 105 110 

Gin Leu Leu Leu He Gly Ala Thr Trp Leu Leu Ser Ala Ala Val Ala 
115 120 125 

Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg Asp Pro Ala 
130 135 140 

Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser Ser Val Cys 
145 150 155 160 

Ser Phe Phe Leu Pro Cys Pro Leu Leu Tyr Trp Ala Thr Phe Arg Gly 
165 170 175 

Leu Gin Leu Val Ala Arg Arg Ala Lys Leu His Gly Arg Ala Pro Arg 
180 185 19U 

Arg Pro Ser Gly Pro Gly Pro Pro Ser Pro Thr Pro Pro Ala Pro Arg 

195 200 205 

Leu Pro Gin Asp Pro Cys Gly Ala Leu Pro Pro Gin Thr Pro Pro Gin 
210 215 220 

Thr Arg Arg Arg Arg Arg Ala Lys He Thr Gly Arg Glu Ar^ Lys Ala 
225 230 235 240 

Met Arg Val Leu Pro Val Val Val Gly Ala Phe He Leu Cys Trp Thr 
245 250 255 

Pro Phe Phe Val Val His He Thr Gin Ala Leu Cys Pre Ala Cys Ser 



2 60 



2 70 



Asn Val Phe Arg Lys Ala Leu Arg Ala Cys 



Cys 
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305 310 315 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
di) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 
10 Lys lie Ser Leu Ala Val Val Leu Ser Val lie Thr Leu Ala Thr Val 

15 10 15 

Leu Ser Asn Ala Phe Val Leu Thr Arg lie Leu Leu Thr Arg Lys Leu 
20 25 30 

His Thr Pro Ala Asn Tyr Leu He Gly Ser He Ala Thr Thr Asp Leu 
15 35 40 45 

Leu Val Ser He Leu Val Trp He Ser lie Ala Tyr Thr He Thr His 
50 55 60 

Thr Trp Asn Phe Gly Gin He Leu Cys Asp He Trp Leu Ser Ser Asp 
65 70 75 80 

2 0 He Thr Cys Cys ^hr Ala Ser He Leu His Leu Cys Val He Ala Leu 

85 90 95 

Asp Arg Tyr Trp Ala He Thr Asp Ala Leu Glu Tyr Ser Lys Arg Arg 
100 105 110 

Thr Ala Gly His Ala Ala Thr Met He Ala He Val Trp Ala He Ser 
25 115 120 125 

He Cys He Ser He Pro Pro Leu Phe Trp Arg Ala Lys Ala Gin Glu 
130 135 140 

Glu Met Ser Asp Cys Leu Val Asn Thr Ser Gin Ser Tyr Thr He Tyr 
145 150 155 160 

3 0 Ser Thr Cys Gly Ala Phe Tyr He Pro Ser Val Leu Leu He He Leu 

165 170 175 

Tyr Gly Arg He Tyr Arg Ala Ala Arg Asn Arg He Leu Asn Pro Pro 
180 185 190 

Ser Leu Tyr Gly Lys Arg Phe Thr Thr Ala His Leu lie Thr Gly Ser 
35 195 200 205 

Ala Gly Ser Ser Leu Cys Ser Leu Asn Ser Ser Leu His Glu Gly His 
210 215 220 

Asn His Val Lys lie Lys Leu Ala Asp Ser Ala Leu Glu Arg Lys Arg 
225 230 235 240 

40 He Ser Ala Ala Arg Glu Arg Lys Ala Thr Lys He Leu Gly lie lie 

245 250 255 

Leu Gly Ala Phe He lie Cys Trp Leu Pro Phe Phe Val Val Ser Leu 
260 265 27- 

Val Leu Pro lie Cys Arg Asp Ser Cys Trp He His Pro Ala Leu Phe 
45 275 280 285 

Asp Phe Phe Thr Trp Leu Gly Tyr He Asn Ser Leu He Asn Pro lie 
290 295 300 
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lie Tvr Thr Val Phe Asn Glu Glu Phe Arg Gin Ala Phe Gin Lys lie 
305 - 310 315 320 

Val Pro Phe Arg Lys Ala Ser 
325 

5 (2) INFORMATION FOR SEQ ID NO : 3 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Val lie Thr Ser Leu Leu Leu Gly Thr Leu lie Phe Cys Ala Val Leu 
15 10 15 

15 Gly Asn Ala Cys Val Val Ala Ala He Ala Leu Glu Arg Ser Leu Gin 

20 25 30 

Asn Val Ala Asn Tyr Leu He Gly Ser Leu Ala Val Arg Asp Leu Met 
35 40 45 

Val Ser Val Leu Val Leu Pro Met Ala Ala Leu Tyr Gin Val Leu Asn 
20 50 55 60 

Lys Trp Thr Leu Gly Gin Val Thr Cys Asp Leu Phe He Ala Leu Asp 
65 70 75 80 

Val Leu Cys Cys Thr Ser Ser He Leu His Leu Cys Ala He Ala Leu 
85 90 95 

2 5 Asp Arg Tyr Trp Ala He Thr Asp Pro He Asp Tyr Val As:i Lys Arg 

100 105 HC 

Thr Pro Arg Pro Arg Ala Leu He Ser Leu Thr Trp Leu He Gly Phe 
115 120 125 

Leu He Ser He Pro Pro Met Leu Gly Trp Arg Thr Pro Glu Asp Arg 
30 130 135 140 

Ser Asp Pro Asp Ala Cys Thr He Ser Lys Asp His Gly Tyr Thr He 
145 150 155 160 

Tyr Ser Thr He Phe Ala Phe Tyr lie Pro Leu Leu Leu Met Leu Val 
165 170 175 

3 5 Leu Tyr Gly Arg He Phe Arg Ala Ala Arg Phe Arg He Arg Lys Thr 

180 185 190 

Val Lys Lys Val Glu Lys Thr Gly Ala Asp Thr Arg His Gly Ala Ser 
195 200 205 

Pro Ala Pro Gin Pro Lys Lys Ser Val Asn Gly Glu Ser Gly Ser Arg 
40 210 215 220 

Asn Ala Ser Phe Glu Arg Lys Asn Glu Arg Asn Ala Phe Ala Lys Leu 

225 23C 235 240 



Phe Cys Glu Ser Ser Cys His Met Pre Thr Leu He Arg Ala He He 
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275 280 285 

Asn Trp Leu Cys Val lie Asn Ser Leu Leu Asn Pro Val He Tyr Ala 
290 295 300 

Tyr Phe Asn Lys Asp Phe Gin Asn Ala Phe Lys Lys He He Lys Cys 
5 305 310 315 320 

Asn Phe Cys Arg Gin 
325 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
( D } TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gin Asn Trp Pro Ala Leu Ser He Val Val He He He Asn Thr He 
15 10 15 

Gly Gly Asn He Leu Val He Met Ala Val Ser Lys Lys Leu His Asn 
20 25 30 

2 0 Ala Thr Asn Tyr Phe Leu Met Ser He Ala He Ala Asp Me* Leu Val 

35 40 45 

Gly Phe Leu Val Trp Leu Ser Leu Leu Ala He Leu Tyr Asp Tyr Val 
50 55 60 

Trp Pro Leu Pro Arg Tyr Leu Cys Pro Val Trp He Ser Leu Asp Val 
25 65 70 75 80 

Leu Phe Ser Thr Ala Ser He Met His Leu Cys Ala He Ser Leu Asp 
85 90 95 

Arg Tyr Val Ala He Arg Asn Pro lie Glu His Ser Arg Phe Ser Arg 

100 105 110 

3 0 Thr Lys Ala He Met Lys He Ala He Val Trp Ala He Ser He Gly 

115 120 125 

Val Ser Val Pro He Pro Val He Gly Leu Arg Asp Glu Ser Lys Val 
130 135 140 

Phe Val Asn Asn Thr Thr He Cys Val Leu Asn Asp Pro Asn Phe Val 
35 145 150 155 160 

Leu He Gly Ser Phe Val Ala Phe Phe He Pro Thr Leu He Met Val 

165 170 175 

He Thr Tyr Phe Leu Thr He Tyr Val Leu Arg Arg Gin Th. Leu Met 
180 185 190 

40 Leu Leu Arg Gly His Thr Glu Glu Glu lie Ala Met Ser Leu Asn Phe 

195 200 205 

Leu Asn Cys Cys Cys Lys Lys Asn Gly Gly Glu Glu Glu Asn Ala Pro 
210 215 220 

Asn Asn Pro Asn Pro Asp Gin Lys Pro Arg Arg Lys Lys Lys Glu Lys 
45 225 230 235 240 

Arg Pro Arg Gly Thr Met Gin Ala He Asn Asn Glu Lys Lys Ala Ser 
245 250 255 
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Lys Val Leu Gly lie Val Phe Phe Val Phe Leu lie Met Trp Cys Pro 

260 265 270 

Phe Phe lie Thr ^sn lie Leu Ser Val Leu Cys Gly Lys Ala Cys Asn 
275 280 285 

5 Gin Cys Lys Leu Leu Asn Val Phe Val Trp lie Gly Tyr Val Cys Ser 

290 295 300 

Gly lie Asn Pro Val lie Tyr Thr Leu Phe Asn Lys lie Tyr Arg Arg 
305 310 315 320 

Ala Phe Ser Lys Tyr Leu Arg Cys Asp Tyr Lys Pro Asp Lys Lys Pro 
10 325 330 335 

Pro Val Arg Gin lie Pro Arg Val Ala Ala Thr Ala Leu Ser Gly Arg 
340 345 350 

Glu Leu Asn Val Asn lie Tyr Arg His Thr Asn Glu Arg Val Ala Arg 
355 360 365 

15 Lys Ala Asn Asp Pro Glu Pro Gly lie Glu Asn Gin Val Glu Asn Leu 

370 375 380 

Glu 
385 



(2) INFORMATION FOR SEQ ID NO: 32: 
2 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Lys Asn Trp Ser Ala Leu Leu Thr Thr Val Val lie lie Leu Thr lie 
15 10 15 

Ala Gly Asn lie Leu Val lie Met Ala Val Ser Leu Glu Lys Lys Leu 
30 20 25 30 

Gin Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala lie Ala Asp Met 
35 40 45 

Leu Leu Gly Phe Leu Val Trp Val Ser Asn Glu Thr lie Leu Tyr Gly 

50 55 60 

3 5 Tyr Arg Trp Pro Leu Pro Ser Lys Leu Cys Ala He Trp He Tyr Leu 

65 70 75 80 

Asp Val Leu Phe Ser Thr Ala Ser lie Met His Leu Cys Ala lie Ser 
85 90 95 

Leu Asp Arg Tyr Val Ala He Gin Asn Pro He His His Ser Arg Phe 
40 100 105 110 

Asn Ser Arg Thr Lys Ala Phe Leu Lys He lie Ala Val Trp Thr lie 

] 1 s "20 125 
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Val Leu lie Gly Ser Phe Val Ala Phe Phe lie Pro Leu Thr lie Met 
165 170 175 

Val lie Thr Tyr Phe Leu Thr lie Lys Ser Leu Arg Gin Lys Phe Ala 

180 185 190 

5 Thr Leu Cys Val Ser Asp Leu Ser Thr Arg Ala Lys Leu Ala Ser Phe 

195 200 205 

Ser Phe Leu Pro Gin Ser Ser Leu Ser Ser Glu Lys Leu Phe Gin Arg 
210 215 220 

Ser lie His Arg Glu Pro Gly Ser Tyr Ala Gly Arg Lys Thr Met Gin 
10 225 230 235 240 

Ser lie Ser Asn Glu Gin Lys Ala Cys Lys Val Leu Gly lie Val Phe 
245 250 255 

Phe Leu Phe Val Val Met Trp Cys Pro Phe Phe lie Thr Asn lie Met 
260 265 270 

15 Val lie Cys Lys Glu Ser Cys Asn Glu Asn Val lie Gly Ala Leu Leu 

275 280 285 

Asn Val Phe Val Trp lie Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu 
290 295 300 

Val Tyr Thr Leu Phe Asn Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr 
20 305 310 315 320 

Leu Gin Cys Gin Tyr Lys Glu Asn Arg Lys Pro Leu Leu lie Leu Val 
325 330 335 

Asn Thr lie Pro Ala Leu Ala Tyr Lys Ser Ser Gin Leu Gin Val Gly 
340 345 350 

2 5 Gin Lys Lys Asn Ser Gin Glu Asp Ala Glu Gin Thr Val Asp Asp Cys 

355 360 365 

Ser Met Val Thr Leu Gly Lys Gin Gin Ser Glu 
370 375 

(2) INFORMATION FOR SEQ ID NO: 33: 

3 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

3 5 di) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

lie Thr lie Thr Val Val Leu Ala Val Leu lie Leu lie Thr Val Ala 
15 10 15 

Gly Asn Val Val Val Cys lie Ala Val Gly lie Asn Arg Arg Leu Arg 

40 20 25 30 

Asn Leu Thr Asn Cys Phe lie Val Ser Leu Ala lie Thr Asp Leu Leu 
35 40 45 

Leu Gly Leu Leu Val Leu Pro Phe Ser Ala lie Tyr Gin Leu Ser Cys 
50 55 60 

4 5 Lys Trp Ser Phe Gly Lys Val Phe Cys Asn lie Tyr Thr Ser Leu Asp 

65 70 75 80 

Val Met Leu Cys Thr Ala Ser lie Leu Asn Leu Leu lie Ser Leu Asp 
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85 90 95 

Arg Tyr Cys Ala Val Met Asp Pro Leu Arg Tyr Pro Val Leu Val Arg 
100 105 110 

Pro Val Arg Val Ala lie Ser Leu Val Leu lie Trp Val lie Ser lie 
5 115 120 125 

Thr Leu Ser Phe Leu Ser lie His Leu Gly Trp Asn Ser Arg Asn Glu 
130 135 140 

Thr Ser Lys Gly Asn His Thr Thr Ser Lys Cys Lys Val Gin Val Asn 
145 150 155 160 

10 Glu Val Tyr Gly Leu Val Asp Gly Leu Val Thr Phe Tyr Leu Pro Leu 

165 170 175 

Leu lie Met Cys lie Thr Tyr Tyr Arg lie Phe Lys Val Ala Arg Asp 
180 185 150 

Ala Lys Arg Asn Mis lie Ser Ser Trp Lys Ala Ala Thr lie Arg Glu 
15 195 200 205 

His Lys Ala Thr val Thr lie Ala Ala Val Met Ala Phe lie lie Cys 
210 215 220 

Trp Phe Pro Tyr Phe Thr Ala Phe Val Tyr Arg Gly Leu Arg Gly Asp 
225 230 235 240 

2 0 Asp Ala lie Asn Glu Val Leu Glu Ala lie Val Leu Trp Leu Gly Tyr 

245 250 255 

Ala Asn Ser Ala Leu Asn Pro lie Leu Tyr Ala Ala Leu Asn Arg Asp 
260 265 270 

Phe Arg Thr Gly Tyr Gin Gin Leu Phe Cys Cys Arg lie Ala Asn Arg 
25 275 280 285 

Asn Ser His Lys Thr Ser Leu Arg Ser Asn Ala Ser Gin Leu Ser Arg 
290 295 300 

Thr Gin Ser Arg Glu Pro Arg Gin Gin Glu Glu Lys Pro Leu Lys Leu 
305 310 315 320 

3 0 Gin Val Trp Ser Gly Thr Glu Val Thr Ala Pro Gin Gly Ala Thr Asp 

325 330 335 

Arg 



(2) INFORMATION FOR SEQ ID NO: 34: 

3 5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 315 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

4 0 (ii ) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 34 



Val -hr Thr lie Ser Tyr Leu Asn Leu Ala Val Ala Asp Phe Cys Phe 
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35 40 45 

Thr Ser Thr Leu Pro Phe Phe Met Val Arg Leu Gly His Trp Pro Phe 
50 55 60 

Gly Trp Phe Leu Cys Lys Phe Leu Phe Thr lie Val Asp lie Asn Leu 
5 65 70 75 80 

Phe Gly Ser Val Phe Leu lie Ala Leu lie Ala Leu Asp Arg Cys Val 
85 90 95 

Cys Val Leu His Pro Val Trp Thr Gin Asn His Arg Thr Val Ser Leu 
100 105 110 

10 Ala Lys Lys Val lie lie Gly Pro Trp Val Met Ala Leu Leu Leu Thr 

115 120 125 

Leu Pro Val lie lie Arg Val Thr lie Val Pro Gly Lys Thr Gly Thr 
130 135 140 

Val Ala Cys Thr Phe Asn Phe Ser Pro Trp Thr Asn Asp Pro Lys Glu 
15 145 150 155 160 

Arg lie Asn Val Ala Val Ala Met Leu Thr Val Arg Gly lie He Arg 
165 170 175 

Phe He He Gly Phe Ser Ala Pro Met Ser He Val Ala Val Ser Tyr 
180 185 190 

2 0 Gly Leu He Ala Thr Lys He He Lys Ser Ser Arg Pro Leu Ara Val 

195 200 205 

Leu Ser Phe Val Ala Ala Ala Phe Phe Leu Cys Trp Ser Pro Tyr Gin 
210 215 220 

Val Val Ala Leu He Ala Thr Val Arg He Arg Glu Leu Leu Gin Gly 
25 225 230 235 240 

Met Tyr Lys Glu He Gly He Ala Val Asp Val Thr Ser Ala He Ala 
245 250 255 

Phe Phe Asn Ser Cys Leu Asn Pro Leu Tyr Val Phe Met Gly Gin Asp 
260 265 270 

3 0 Phe Arg Glu Arg Leu He His Ala Leu Pro Ala Ser Leu Glu Arg Ala 

275 280 285 

Leu Thr Glu Asp Ser Thr Gin Thr Ser Asp Thr Ala Thr Asn Ser Thr 
290 295 300 

Leu Pro Ser Ala Glu Val Ala Leu Gin Ala Lys 
35 305 310 315 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 

Asp He Leu Ala Leu Val lie Phe Ala Val Val Phe Leu Va 1 Gly Val 
45 l 5 10 15 

Leu Gly Asn Ala Leu Val Val Trp Val Thr Ala Phe Glu Ala Lys Arg 
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20 25 30 

Thr lie Asn Ala He Trp Phe Leu Asn He Ala Val Ala Asp Phe Leu 
35 40 45 

Ser Cys Leu Ala Leu Pro He Leu Phe Thr Ser He Val Gin His His 
5 50 55 60 

His Trp Pro Phe Gly Gly Ala Ala Cys Ser He Leu Pro Ser Leu He 
65 70 75 80 

Leu Leu Asn Met Tyr Ala Ser He Leu Leu Leu Ala Thr He Ser Ala 
85 90 95 

10 Asp Arg Phe Leu Leu Val Phe Lys Pro He Trp Cys Gin Asn Phe Arg 

100 105 110 

Gly Ala Gly Leu Ala Trp He Ala Cys Ala Val Ala Trp Gly He Ala 
115 120 125 

Leu Leu Leu Thr lie Pro Ser Phe Leu Tyr Arg Val Val Arg Glu Glu 
15 130 135 140 

Tyr Phe Pro Pro Lys Val Leu uys Gly Cys Asp Tyr Set Kib Asp Lys 
145 150 155 160 

Arg Arg Glu Arg Ala Val Ala He Val Arg Leu Val Leu Gly Phe Leu 
165 170 175 

2 0 Trp Pro Leu Leu Thr Leu Thr He Cys Tyr Thr Thr Arg Ser Thr Lys 

180 185 190 

Thr Leu Lys Val Val Val Ala Val Val Ala Ser Phe Phe He Phe Trp 
195 200 205 

Leu Pro Tyr Gin Val Thr Gly He Met Met Ser Phe Leu Glu Pro Ser 
25 210 215 220 

Ser Pro Thr Phe Leu Leu Leu Asn Lys Leu Asp Ser Leu Cys Val Ser 
225 230 235 240 

Phe Ala Tyr He Asn Cys Cys He Asn Pro He He Tyr Val Val Ala 
245 250 255 

3 0 Gly Gin Gly Gin Phe Gin Gly Arg Leu Arg Lys Ser Leu Pro Ser Leu 

260 265 270 

Leu Arg Asn Val Leu Thr Glu Glu Ser Val Val Arg Glu Ser Lys Ser 
275 280 285 

Phe Thr Arg Ser Thr Val Asp Thr Met Ala Gin Lys Thr Gin Ala Val 
35 290 295 300 

(2) INFORMATION FOR SEQ ID NO : 3 6 : 
ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



Pro Leu Asn lie Met Ala lie Val Val Phe He Leu Lys Met Lys Val 
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20 25 30 

Lys Lys Pro Ala Val His lie Ala Thr Ala Asp Val Leu Phe Val Ser 
35 40 45 

Val Leu Pro Phe Lys lie Ser Tyr Tyr Phe Ser Gly Ser Asp Trp Gin 
5 50 55 60 

Phe Gly Ser Glu Leu Cys Arg Phe Val Thr Ala Ala Phe Tyr Cys Asn 
65 70 75 80 

Met Tyr Ala Ser lie Leu Leu lie Ser lie Asp Arg Phe lie Ala Val 

85 90 95 

10 Val Tyr Pro Met Gin Ser Leu Ser Trp Arg Thr Leu Gly Arg Ala Ser 

100 105 110 

Phe Thr Cys lie Ala lie Trp Ala lie Ala He Ala Gly Val Pro Leu 
115 120 125 

Val Leu Lys Glu Gin Thr He Gin Val Pro Gly Leu Asn He Thr Thr 
15 130 135 140 

He Cys His Asp Val Leu Asn Glu Thr Leu Leu Glu Gly Tyr Tyr Ala 
145 150 155 160 

Tyr Tyr Phe Ser Ala Phe Ser Ala Val Phe Phe Phe Val Pro Leu He 

165 170 175 

2 0 He Ser Thr Val Cys Tyr Val Ser He He Arg Cys Leu Ser Ser Ser 

180 185 190 

Ala Val Ala Asn Arg Ser Lys Lys Ser Arg Thr Asn Arg Cys Phe Asn 
195 200 205 

Ser Thr Val Ala Leu Phe Leu Ser Ala Ala Val Phe Cys He Phe He 
25 210 215 220 

He Cys Phe Gly Pro Thr Trp Leu Leu He Ala His Tyr Ser Phe Leu 
225 230 235 240 

Ser His Thr Ser Thr Thr Glu Ala Ala Tyr Phe Ala Tyr Leu Leu Cys 
245 250 255 

3 0 Val Cys Val Ser Ser He Ser Ser Cys He Asp Pro Leu He Tyr Tyr 

260 265 270 

Tyr Ala Ser Ser Glu Cys Gin Arg Tyr Val Tyr Ser He Leu Cys Cys 
275 280 285 

Lys Glu Ser Ser Asp Pro Ser Ser Tyr Asn Ser Ser Gly Gin Leu Met 

35 290 295 300 

Ser Leu Thr Cys Ser Ser Asn Leu Asn Asn Ser He Tyr Lys Lys Leu 
305 310 315 320 

Leu Thr 

4 0 (2) INFORMATION FOR SEQ ID NO : 3 7 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
4 5 (D) TOPOLOGY: linear 

(n) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 7 : 

Tyr lie Asn Thr Val lie Ser Cys Thr He Phe He Val Gly Trp Gly 

15 10 15 

Asn Ala Thr Leu Leu Arg He He Tyr Gin Asn Lys Cys Met Arg Asn 

5 20 25 30 

Gly Pro Asn Ala Leu He Ala Ser He Ala Leu Gly Asp Leu lie Tyr 
35 40 45 

Val Val He Asp Leu Pro lie Asn Val Pro Lys Leu He Ala Gly Arg 

50 55 60 

10 Trp Pro Phe Glu Gin Asn Asp Phe Gly Val Phe Cys Lys Phe Met Gly 

65 70 75 80 

Val Val Met lie Phe Phe Gly Leu Ser Pro Leu Leu Leu Gly Ala Ala 
85 90 95 

Met Ala Ser Glu Arg Tyr Leu Gly He Thr Arg Pro Phe Ser Arg Pro 
15 100 105 110 

Ala Val Ala Ser Gin Arg Arg Ala Trp Ala Thr Val Gly Leu Val Trp 
115 120 125 

Ala Ala Ala Leu Ala Leu Gly Leu Leu Pro Leu Leu Gly Val Gly Arg 
130 135 140 

2 0 Tyr Thr Val Gin Tyr Pro Gly Ser Trp Cys Phe Leu Thr Leu Gly Ala 

145 150 155 160 

Glu Ser Gly Asp Val Ala Phe Gly Leu Leu Phe Ser Gly Leu Ser Val 
165 170 175 

Gly Leu Ser Phe Leu Leu Asn Thr Val Ser Val Ala Thr Leu His His 
25 180 185 190 

Val Tyr His Gly Gin Glu Ala Ala Gin Gin Arg Pro Arg Asp Ser Glu 
195 200 205 

Val Glu Met Met Ala Gin Leu Leu Gly He Met Val Val Ala Ser Val 
210 215 220 

3 0 Cys Trp Leu Pro Leu Leu Val Phe He Ala Gin Thr Val Leu Arg Asn 

225 230 235 240 

Pro Pro Ala Met Ser Pro Ala Gly Gin Leu Ser Arg Thr Thr Glu Lys 

245 250 255 

Glu Leu Leu He Tyr Leu Arg Val Ala Thr Trp Asn Gin He Leu Asp 

35 260 265 270 

Pro Trp Val Tyr He Leu Phe Arg Arg Ala Val Leu Arg Arg Leu Gin 
275 280 285 

Pro Arg Leu Ser Thr Arg Pro Arg Ser Leu Ser Leu Gin Pro Gin Leu 
290 295 30C 

4 0 Thr Gin Arg Ser Gly Leu Gin 

3 OS 3 10 



iC? STRANDEDNESS : sing j 
( D } TOPOLOGY: linear 
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:ii) MOLECULE TYPE: peptide 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 

Lys Tyr Phe Val Val lie lie Tyr Ala Leu Val Phe Leu Leu Ser Leu 

15 10 15 

5 Leu Gly Asn Ser Leu Val Met Leu Val lie Leu Tyr Ser Arg Gly Val 

20 25 30 

Arg Ser Val Thr lie Val Tyr Leu Leu Asn lie Ala lie Ala Asp Leu 
35 40 45 

Leu Phe Ala Leu Thr Leu Pro lie Trp Ala Ala Ser Lys Val Asn Gly 
10 50 55 60 

Trp lie Phe Gly Thr Phe Leu Cys Lys Trp Ser Leu Leu Lys Glu Val 

65 70 75 80 

Asn Phe Tyr Ser Gly lie Leu Leu Leu Ala Cys lie Ser Val Asp Arg 
85 90 95 

15 Tyr Leu Ala lie Val Arg Ala Thr Arg Thr Leu Thr Gin Lys Arg His 

100 105 llu 

Leu Val Lys Phe lie Cys Leu Ser lie Trp Gly Leu Ser Leu Leu Leu 
115 120 125 

Ala Leu Pro Val Leu Leu Phe Arg Arg Thr Val Tyr Ser Ser Asn Val 
20 130 135 140 

Ser Pro Ala Cys Tyr Glu Asp Met Gly Asn Asn Tyr Ala Asn Trp Arg 
145 150 155 160 

Met Leu Leu Pro lie Leu Pro Gin Ser Phe Gly Phe lie Val Pro Leu 
165 170 175 

2 5 Leu lie Met Leu Tyr Cys Tyr Gly Phe Thr Leu Arg Thr Leu Phe Lys 

180 185 190 

Ala lie Met Gly Gin Lys His Arg Ala Met Arg Val lie Phe Ala Val 
195 200 205 

Val Leu lie Phe Leu Leu Cys Trp Leu Pro Tyr Asn Leu Val Leu lie 

30 210 215 220 

Ala Asp Thr Leu Met Arg Thr Gin Val lie Gin Glu Thr Cys Glu Arg 
225 230 235 240 

Arg Asn His lie Asp Arg Ala lie Asp Ala Thr Glu lie Leu Gly lie 
245 250 255 

3 5 Leu His Ser Cys Leu Asn Pro Leu lie Tyr Ala Phe lie Gly Gin Lys 

260 265 270 

Phe Arg His Gly Leu Leu Lys lie Leu Ala lie His Gly Leu lie Ser 
275 280 285 

Lys Asp Ser Leu Pro Lys Asp Ser Arg Pro Ser Phe Val Gly Ser Ser 
40 290 295 300 

Ser Gly His Thr Ser Thr Thr Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS : 

4 5 (A) LENGTH: 32 6 ammo acids 

(B) TYPE: amino acid 



BNSDOCID: <WO_..9405695A1_I_> 



WO 94/05695 W W PCT/US93/08528 

- 93 - 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(u) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

5 Leu Phe Pro lie Val Tyr Ser lie lie Phe Val Leu Gly lie lie Ala 

15 10 15 

Asn Gly Tyr Val Leu Trp Val Phe Ala Arg Leu Tyr Pro Ser Lys Lys 

20 25 30 

Asn Glu lie Lys lie Phe Met Val Asn Leu Thr Val Ala Asp Leu Leu 
10 35 40 45 

Phe Leu He Thr Leu Pro Leu Trp He Val Tyr Tyr Ser Asn Gin Gly 
50 55 60 

Asn Trp Phe Leu Pro Lys Phe Leu Cys Asn Leu Ala Gly Cys Leu Phe 
65 70 75 80 

15 Phe He Asn Thr Tyr Cys Ser Val Ala Phe Leu Gly Val He Thr Tyr 

85 90 95 

Asn Arg Phe Gin Ala Val Lys Tyr Pro He Lys Thr Ala Gin Ala Thr 
100 105 HO 

Thr Arg Lys Arg Gly He Ala Leu Ser Leu Val He Trp Val Ala He 
20 115 120 125 

Val Ala Ala Ala Ser Tyr Phe Leu Val Met Met Asp Ser Thr Asn Val 
130 135 140 

Val Ser Asn Lys Ala Gly Ser Gly Asn He Thr Arg Cys Phe Glu Arg 
145 150 155 160 

2 5 Tyr Glu Lys Gly Ser Lys Pro Val Leu He He His He Cys He Val 

165 170 175 

Leu Gly Phe Phe He Val Phe Leu Leu He Leu Phe Cys Asn Leu Val 
180 185 190 

He He His Thr Leu Leu Arg Gly Pro Val Lys Gin Gin Arg Asn Ala 

30 195 200 205 

Glu Val Arg Arg Arg Ala Leu Trp Met Val Cys Thr Val He Ala Val 
210 215 220 

Phe Val He Cys Phe Val Pro His His Met Val Gin Leu Pro Trp Thr 
225 230 235 240 

3 5 Leu Ala Glu Leu Gly Met Trp Pro Ser Ser Asn His Gin Ala He Asn 

245 250 255 

Asp Ala His Gin Val Thr Leu Cys Leu Leu Ser Thr Asn Cys Val Leu 
260 265 270 

Asp Pro Val He Tyr Cys Phe Leu Thr Lys Lys Phe Arg Lys His Leu 

40 275 280 285 



Val Asn Pre He Lys Asn 
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325 

(2) INFORMATION FOR SEQ ID NO: 40: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 333 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
(li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 : 
10 Tyr lie Asn Thr lie Val Ser Cys Leu Val Phe Val Leu Gly lie lie 

15 10 15 

Gly Asn Ser Thr Leu Leu Arg lie lie Tyr Lys Asn Lys Cys Met Arg 

20 25 30 

Asn Gly Pro Asn lie Leu lie Ala Ser lie Ala Leu Gly Asp Leu Leu 
IB 35 40 45 

His lie lie lie Asp lie Pro lie Met Ala Tyr Lys Leu lie Ala Gly 
50 55 60 

Asp Trp Pro Phe Ala Cys Lys Leu Phe Pro Phe Leu Gin Lys Ser Ser 

65 70 75 80 

2 0 Val Gly lie Thr Val Leu Asn Leu Cys Ala Leu Ser Val Asp Arg Tyr 

85 90 95 

Arg Ala Val Ala Ser Trp Ser Arg Val Gin Gly lie Gly lie Pro Leu 
100 105 110 

Val Thr Ala lie Glu lie Val Ser lie Trp lie Leu Ser Phe lie Leu 
25 115 120 125 

Ala lie Pro Glu Ala lie Gly Phe Trp Met Val Pro Phe Glu Tyr Lys 
130 135 140 

Gly Ala Gin His Arg Thr Cys Met Leu Asn Ala Thr Ser Lys Leu Phe 
145 150 155 160 

3 0 Tyr Gin Asp Val Lys Asp Trp Trp Leu Phe Gly Phe Tyr Phe Leu Leu 

165 170 175 

Val Cys Thr Ala lie Phe Tyr Thr Leu Met Thr Cys Glu Met Leu Asn 
180 185 190 

Arg Arg Asn Gly Ser Leu Arg lie Ala Leu Ser Glu His Leu Lys Gin 
35 195 200 205 

Arg Arg Glu Val Ala Lys Thr Val Phe Cys Leu Val Val lie Phe Ala 
210 215 220 

Leu Cys Trp Phe Pro Leu His Leu Ser Arg lie Leu Lys Lys Thr Val 
225 230 235 240 

4 0 Tyr Asp Glu Met Asp Thr Asn Arg Cys Glu Leu Leu Ser Phe Leu Leu 

245 250 255 

Leu Met Tyr lie Gly lie Asn Thr Ala Thr Met Ser Cys lie Asn Pro 

260 265 270 

lie Ala Leu Tyr Phe Val Ser Lys Lys Phe Lys Asn Cys Phe Gin Ser 
45 275 280 285 

Cys Leu Cys Cys Cys Cys Tyr Gin Ser Lys Ser He Met Thr Ser Val 
290 295 300 
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Pro Met Gin Gly Thr Ser He Gin Trp Lys Asn His Glu Gin Asn Asn 

305 310 315 320 

His Asn Thr Glu Arg Ser Ser His Lys Asp Ser He Asn 
325 330 

5 (2) INFORMATION FOR SEQ ID NO : 4 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Leu He Ala Ser Pro Trp Phe Ala Ala Ser Phe Cys Val Val Gly Leu 

15 10 15 

15 Ala Ser Asn Leu Leu Ala Leu Ser Val Leu Ala Gly Ala Arg Gin Ser 

20 25 30 

Ser Ser His Thr Arg Ser Ser Phe Leu Thr Phe Leu Cys Gly Leu Val 
35 40 45 

Leu Thr Leu Asp Phe Leu Gly Leu Leu Val Thr Gly Thr He Val Val 
20 50 55 60 

Ser Gin His Ala Ala Leu Phe Glu Trp His Ala Val Asp Pro Gly Cys 
65 70 75 80 

Arg Leu Cys Arg Leu Val Pro Phe He Gin Lys Ala Ser Val Gly He 
85 90 95 

25 Thr Val Leu Ser Leu Cys Ala Leu Ser He Asp Arg Tyr Arg Ala Val 

100 105 110 

Ala Ser Trp Ser Arg He Lys Gly He Gly Val Pro Lys Trp Thr Ala 
115 120 125 

Val Glu He Val Leu He Trp Val Val Ser Val Val Leu Ala Val Pro 
30 130 135 140 

Glu Ala He Gly Phe Asp Thr Thr Ser Asp Tyr Lys Gly Lys Pro Leu 
145 150 155 160 

Arq Val Cys Met Leu Asn Pro Phe Gin Lys Thr Ala Phe Met Phe Tyr 
165 170 175 

3 5 Lys Thr Ala Ala Lys Asp Trp Trp Leu Phe Ala Phe Tyr Phe Cys Leu 

180 185 190 

Pro Leu Ala He Thr Ala He Phe Tyr Thr Leu Met Thr Cys Glu Met 
195 200 205 

Leu Arg Lys Lys Ser Gly Met Gin He Ala Leu Asn Asp His Leu Lys 
40 210 215 220 

Gin Arg Arg Glu Val Ala Lys Thr Val Phe Cys Leu Val Leu Val Phe 

225 230 235 240 



Leu Val Leu Asp Tyr He Gly He Asn Met Ala Ser lie Asn Ser Cys 
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275 280 285 

He Asn Pro He Ala Leu Tyr Leu Val Ser Lys Arg Phe Lys Asn Cys 
290 295 300 

Phe Lys Ser Cys Leu Cys Cys Trp Cys Gin Thr Phe Glu Glu Lys Gin 

5 305 310 315 320 

Ser Leu Glu Glu Lys Gin Ser Cys Leu Lys Phe Lys Ala Asn Asp His 
325 330 335 

Gly Tyr Asp Asn Phe Arg Ser Ser Asn Lys Tyr Ser Ser Ser 
340 345 350 

10 (2) INFORMATION FOR SEQ ID NO: 42: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 2 : 

lie Tyr Val He Pro Ala Val Tyr Gly Leu He He Val He Gly Leu 
15 10 15 

20 He Gly Asn He Thr Leu He Lys lie Phe Cys Thr Val Lyj Ser Leu 

20 25 30 

Asn Leu Phe He Ser Ser He Ala Leu Gly Asp Leu Leu Leu Leu Val 
35 40 45 

Thr He Cys Ala Pro Val Asp Ala Ser Lys Tyr He Ala Asp Arg Trp 
25 50 55 60 

Leu Phe Gly Arg He Gly Cys Lys Leu He Pro Phe He Gin Leu Thr 
65 70 75 80 

Ser Val Gly Val Ser Val Phe Thr Leu Thr Ala Leu Ser Ala Asp Arg 
85 90 95 

3 0 Tyr Lys Ala He Val Arg Pro Thr Cys lie Gin Ala Ser Leu He Cys 

100 105 110 

Leu Lys Ala Ala Leu He Trp He Val Ser Leu Leu Ala lie Pro Glu 
115 120 125 

Ala Val Phe Ser Asp Leu His Pro Phe His Val Lys Asp Thr Asn Gin 

35 130 135 140 

Thr Phe lie Ser Cys Ala Pro Tyr Pro His Ser Asn Glu Leu His Pro 
145 150 155 160 

Lys He His Ser Met Ala Ser Phe Leu Val Phe Tyr Val He Pro Leu 
165 170 175 

4 0 Ala He He Ser Val Tyr Tyr Tyr Phe lie Ala Arg Asn Leu He Gin 

180 185 190 

Ser Ala Tyr Asn Leu Pro Val Glu Gly Asn He His Val Lys Lys Gin 
195 200 205 

lie Glu Ser Arg Lys Arg Leu Ala Lys Thr Val Leu Val Ph^* Val Gly 
45 210 215 220 

Leu Phe Ala Phe Cys Trp Leu Pro Asn His Val He Tyr Leu Tyr Arg 

225 230 235 240 
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Ser Tyr His Tyr Ser Glu Val Asp Thr Ser Met Leu His Phe Val Thr 
245 250 255 

Ser lie Cys Ala Arg Leu Leu Ala Pro Thr Asn Ser Cys Val Asn Pro 
260 265 270 

5 Phe Ala Leu Tyr Leu Leu Ser Lys Ser Phe Arg Gin Phe Asn Thr Gin 

275 280 285 

Leu Leu Cys Cys Gin Pro Gly Leu Ser His Ser Thr Gly Arg Ser Leu 
290 295 300 

Ser Phe Lys Ser Thr Asn Pro Ser Ala Thr Phe Ser Leu lie Asn Arg 
10 305 310 315 320 

Asn lie Cys His Glu Gly Tyr Val 
325 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 34 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOiiOGY : i inear 
(ii) MOLECULE TYPE: peptide 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Cys Val lie Pro Ser Ser Leu Tyr Leu lie lie lie Ser Val Gly Leu 
15 10 15 

Leu Gly Asn lie Met Leu Val Lys lie Phe Leu Thr Asn Ser Thr Met 

20 25 30 

25 Arg Ser Val Pro Asn lie Phe lie Ser Asn lie Ala Ala G1-* Asp Leu 

35 40 45 

Leu Leu Leu Leu Thr Cys Val Pro Val Asp Ala Ser Arg Tyr Phe Phe 
50 55 60 

Asp Glu Trp Val Phe Gly Lys Leu He Gly Cys Lys Leu He Pro Ala 
30 65 70 75 80 

He Gin Leu Thr Ser Val Gly Val Ser Val Pro Thr Leu Thr Ala Leu 

85 90 95 

Ser Ala Asp Arg Tyr Arg Ala He Val Asn Pro Met Asp Met Thr Ser 
100 105 110 

3 5 Gly Val Val Leu Trp Thr Ser Val Ala Val Gly He Trp Val Val Ser 

115 120 125 

Val Leu Leu Ala Val Pro Glu Ala Val Phe Ser Glu Val Ala Arg He 
130 135 140 

Gly Ser Ser Asp Asn Ser Ser Phe Thr Ala Cys He Pro Tyr Pro Gin 
40 145 150 155 160 

Thr Asp Glu Leu His Pro Lvs He His Ser Val Leu He Phe Leu Val 

165 " 170 175 



Glu 



His Thr Lys Lys Gin Met Glu Thr Arg Lys Arg Leu Ala Lys He 
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210 215 220 

Val Leu Val Phe Val Gly Cys Phe Val Phe Cys Trp Phe Pro Asn His 
225 230 235 240 

lie Leu Tyr Leu Tyr Arg Ser Phe Asn Tyr Lys Glu lie Asp Pro Ser 
5 245 250 255 

Leu Gly Thr Cvs Val Thr Leu Val Ala Arg Val Leu Ser Phe Ser Asn 
260 265 270 

Ser Cys Val Asn Pro Phe Ala Leu Tyr Leu Leu Ser Glu Ser Phe Arg 
275 280 285 

10 Lys His Phe Ser Asn Gin Leu Cys Cys Gly Gin Lys Ser Tyr Pro Glu 

290 295 300 

Arg Ser Thr Ser Tyr Leu Leu Ser Ser Ser Ala Val Trp Arg Ser Leu 
305 310 315 320 

Lys Ser Asn Ala Lys Asn Val Val Thr Asn Ser Val Leu lie Asn Gly 
15 325 330 335 

His Ser Thr Lys Gin Glu lie Ala Leu 
340 345 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Tyr Thr Leu Ser Phe lie Tyr lie Phe lie Phe Val lie Cys Glx Leu 
15 10 15 

Leu Ala Asn Ser Val Val Val Trp Val Asn He Gin Ala Lys Thr Thr 

20 25 30 

3 0 Gly Tyr Asp Thr His Cys Tyr He Leu Asn Leu Ala He Ala Asp Leu 

35 40 45 

Trp Trp Leu Thr He Pro Val Trp Trp Ser Leu Val Gin His Asn Gin 
50 55 60 

Trp Pro Met Gly Glu Leu Thr Cys Lys Val Thr His Leu He Phe Ser 
35 65 70 75 80 

He Asn Leu Phe Ser Gly He Phe Phe Leu Thr Cys Met Ser Val Asp 

85 90 95 

Arg Tyr Leu Ser He Thr Tyr Phe Thr Asn Thr Pro Ser Sei Arg Lys 
100 105 110 

4 0 Lys Met Val Arg Arg Ala Val Cys He Leu Val Trp Leu Leu Ala Phe 

115 120 125 

Cys Val Ser Leu Pro Asp Thr Tyr Tyr Leu Lys Thr Val Thr Ser Ala 
130 135 140 

Ser Asn Asn Glu Thr Tyr Cys Arg Ser Phe Tyr Pro Glu His Ser He 
45 145 150 155 160 

Lys Glu Trp Leu lie Ser Leu Leu Val Ser Val Val Leu He Gly Phe 
165 170 175 
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Ala Val Pro Phe Ser lie lie Ala Val Phe Tyr Phe Ser Leu lie Ala 
180 185 190 

Arg Ala lie Ser Ala Ser Ser Asp Gin Glu Lys His Ser Ser Arg Lys 
195 200 205 

5 lie lie Phe Ser Tyr Val Val Val Phe Leu Val Cys Trp Leu Pro Tyr 

210 215 220 

His Val Ala Val Leu Leu Asp lie Phe Ser lie Leu His Tyr lie Pro 
225 230 235 240 

Phe Thr Cys Arg Leu Glu His Ala Leu Phe Thr Ala Leu His Val Thr 
10 245 250 255 

Gin Cys Leu Ser Leu Val His Cys Cys Val Asn Pro Val Leu Tyr Ser 
260 265 270 

Phe lie Asn Arg Asn Tyr Arg Tyr Glu lie Asn Trp lie Phe Lys Tyr 
275 280 285 

15 Ser Ala Lys Thr Gly Leu Thr Lys Leu lie Asp Ala Ser Arg Val Ser 

290 295 300 

Glx Thr Glu Tyr Ser Ala Leu Glu Gin Asn Ala Lys 
305 310 315 

(2) INFORMATION FOR SEQ ID NO:45: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Lys Val Leu Val Thr Ala lie Tyr Leu Ala Leu Phe Val Va: Gly Thr 
15 10 15 

Val Gly Asn Ser Val Thr Ala Phe Thr Leu Ala Arg Lys Lys Ser Leu 
30 20 25 30 

Gin Ser Leu Gin Ser Thr Val His Tyr His Leu Ser Ser Leu Ala Leu 
35 40 45 

Ser Asp Leu Leu lie Leu Leu Trp Val Glu Leu Tyr Asn Phe lie Trp 

50 55 60 

3 5 His His Pro Trp Ala Phe Gly Asp Ala Gly Cys Arg Gly Tyr Tyr Phe 

65 70 75 80 

Leu Arg Asp Ala Cys Thr Tyr Ala Thr Ala Leu Asn Val Ala Ser Leu 

85 90 95 

Ser Val Glu Arg Tyr Leu Ala He Cys His Pro Phe Lys Ala Lys Thr 
40 100 105 110 

Leu Met Ser Arg Ser Arg Thr Lys Lys Phe lie Ser Ala He Trp Leu 

115 120 125 



He Val Asp Thr Ala Thr Val Lys Val Val He Gin Val Asn Thr Phe 
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165 170 175 

Met Ser Phe Leu Phe Pro Met Leu Val lie Ser lie Leu Asn Thr Val 
180 185 190 

lie Ala Asn Lys Leu Thr Val Met Val His Gin Ala Ala Glu Gin Gly 
5 195 200 205 

Arg Val Cys Thr Val Gly Thr His Asn Gly Leu Glu His Ser Thr Phe 
210 215 220 

Asn Met Arg lie Glu Pro Gly Arg Val Gin Ala Leu Arg His Gly Val 
225 230 235 240 

10 Leu Val Leu Arg Ala Val Val lie Ala Phe Val Val Cys Trp Leu Pro 

245 250 255 

Tyr Leu Cys Tyr lie Ser Asp Glu Gin Trp Arg Thr Phe Leu Phe Asp 
260 265 270 

Phe Tyr His Tyr Phe Tyr Met Leu Thr Asn Ala Leu Phe Tyr Val Ser 
15 275 280 285 

Ser Ala lie Asn Pro lie Leu Tyr Asn Leu Val Ser Ala Asn Phe Arg 
290 295 300 

Gin Val Phe Leu Ser Thr Leu Ala Cys Leu Phe Cys Pro Gly Trp Pro 
305 310 315 320 

2 0 Leu lie Arg Arg Lys Lys Arq Pro Thr Phe Ser Arg Lys Pro Asn Ser 

325 ~ 330 335 

Met Ser Ser Asn His Ala Phe Ser Thr Ser Ala Thr Arg Phe Thr Leu 
340 345 350 



25 



Tyr 



(2) INFORMATION FOR SEQ ID NO : 4 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Ala lie Gin Ala Pro Phe Leu Trp Val Leu Phe Leu Leu Ala Ala Leu 

35 1 5 10 15 

Glu Asn lie Phe Val Leu Ser Val Phe Cys Leu His Lys Thr Asn Cys 

20 25 30 

Thr Val Ala Glu lie Tyr Leu Gly Asn lie Ala Ser Ala Asp Leu lie 
35 40 45 

4 0 lie Ala Cys Gly Leu Pro Phe Trp Ala lie Thr lie Ala Asn Asn Phe 

50 55 60 

Asp Trp Leu Phe Gly Glu Val Leu Cys Arg Val Val Asn Leu Tyr Met 

65 70 75 80 

Asn Leu Tyr Ser Ser lie Cys Phe Leu Val Ser lie Asp Arq Tyr Leu 
45 85 90 95 

Ala Leu Val Lys Thr Met Ser Asn Leu Arg Trp Ala Lys Leu Tyr Ser 
100 105 110 
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Leu Val lie Trp Ser Cys Thr Leu Leu Leu Ser Ser Pro Met Leu Val 

115 120 125 

Phe Arg Thr Met Tyr Arg Glu Glu Gly His Asn Val Thr Cyd Val lie 
130 135 140 

5 Val Tyr Pro Ser Arg Ser Trp Glu Val Phe Leu Leu Asn Leu Val Gly 

145 150 155 160 

Phe Leu Leu Pro Leu Ser lie lie Thr Phe Cys Thr Val Arg lie Met 
165 170 175 

Val Leu Arg Asn Asn Glu Met Lys Lys Phe Lys Glu Val Gin Thr Glu 
10 180 185 190 

Lys Lys Ala Thr Val Leu Val lie Ala Val Leu Gly Leu Phe Val Leu 
195 200 205 

Cys Trp Phe Pro Phe Gin lie Ser Thr Phe Leu Asp Thr Leu Leu Arg 
210 215 220 

15 Leu Gly Val Leu Ser Gly Cys Trp Asn Glu Arq Ala Val Asp lie Val 

225 230 235 240 

Arg Gin lie Ser Ser Tyr Val Ala Tyr Ser Asn Ser Cys Leu Asn Pro 
245 250 255 

Leu Val Tyr Val lie Val Gly Lys Arg Phe Arg Lys Lys Ser Arg Glu 
20 260 265 270 

Val Tyr Gin Ala lie Cys Arg Lys Gly Gly Cys Met Gly Glu Ser Val 
275 280 285 

Leu Asn Ser Met Gly Thr Leu Arg Thr Ser lie Ser Val Asp Arg Gin 
290 295 300 

2 5 lie His Lys Leu Gin Asp Trp Ala Gly Asn Lys Gin 

305 310 315 

(2) INFORMATION FOR SEQ ID NO : 4 7 : 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 7 amino acids 

3 0 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 : 
3 5 He Leu Leu Val Val He He Cys Gly Leu Gly He Val Gl> Asn He 

1 r . 10 15 

Met Val Val Leu Val Val Met Arg Thr Thr Pro Thr Asn Cys Tyr Leu 
20 25 30 

Val Ser He Ala Val Ala Asp Leu Met Val Leu Val Ala Ala Gly Leu 

40 35 40 45 

Pro Asn He Thr Asp Ser He Tyr Gly Ser Trp Val Tyr Gly Tyr Val 



His 



Pre He Lys Ala Gin Phe Leu Cvs Thr 



Phe Ser Ara Ala Lys 



Lvs 
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100 105 110 

lie lie lie Phe Val Trp Ala Phe Thr Ser lie Tyr Leu Phe Leu Leu 
115 120 125 

Asp lie Asn lie Ser Thr Tyr Lys Asn Ala Val Val Val Ser Cys Gly 
5 130 135 " 140 

Tyr Lys lie Ser Arg Asn Tyr Tyr Ser Pro lie Tyr Leu Met Asp Phe 
145 150 155 160 

Gly Val Phe Tyr Val Val Pro Leu lie Ala Thr Val Leu Tyr Gly Phe 
165 170 175 

10 lie Ala Arg lie Leu Phe Leu Asn Pro lie Pro Ser Asp Pro Lys Glu 

180 185 190 

Asn Ser Lys Met Trp Lys Asn Asp Ser lie His Gin Asn Lys Asn Leu 
195 200 205 

Asn Leu Asn Ala Ser Ser Arg Lys Gin Val Thr lie Asn Leu Ala Val 
15 210 215 220 

Val Val lie Leu Phe Ala Leu Leu Trp Asn Thr Tyr Arg Thr Leu Val 
225 230 235 240 

Val Val Asn Ser Phe Leu Ser Ser Pro Phe Gin Glu Asn Trp Lys Leu 
245 250 255 

2 0 Leu Lys Cys Arg lie Cys lie Tyr Leu Asn Ser Ala lie Asn Pro Val 

260 265 270 

lie Tyr Asn lie Met Ser Gin Lys Arg Phe Ala Ala Phe Arg Lys Leu 
275 280 285 

Cys Asn Cys Lys Gin Lys Pro Thr Glu Lys Ala Ala Asn Tyr Ser Val 
25 290 295 300 

Ala Leu Asn Tyr Ser Val lie Lys Glu Ser Asp Arg Phe Ser Thr Glu 
305 310 315 320 

Leu Glu Asp lie Thr Val Thr Asp Thr Tyr Val Ser Thr Thr Lys Val 
325 330 335 

3 0 Ser Phe Asp Asp Thr Cys lie Ala Ser Glu Asn 

340 345 

(2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 amino acids 

3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 

4 0 Leu Ala Leu Trp Ala Thr Ala Tyr Leu Ala Leu Val Leu Va. Ala Val 

15 10 15 

Thr Gly Asn Ala lie Val lie Trp lie He Leu Ala His Arg Arg Met 
20 25 30 

Arg Thr Val Thr Asn Tyr Phe He Val Asn He Ala Leu Ala Asp Leu 
45 35 - 40 45 

Leu Asn Ala Ala Phe Asn Phe Val Tyr Ala Ser His Asn He Trp Tyr 
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50 55 60 

Phe Gly Arg Ala Phe Cys Tyr Phe Gin Asn Leu Phe Pro lie Thr Ala 
65 70 75 80 

Met Phe Val Ser lie Tyr Ser Met Thr Ala lie Ala Ala Asp Arg Tyr 

5 85 90 95 

Met Ala lie Val His Pro Phe Gin Pro Arg Leu Ser Ala Pro Ser Thr 
100 105 110 

Lys Ala Val lie Ala Gly lie Trp Leu Val Ala lie Lys Leu Ala Phe 
115 120 125 

10 Pro Gin Cys Phe Tyr Ser Thr Val Thr Met Gin Gly Ala Thr Lys Cys 

130 135 140 

Val Val Ala Trp Pro Glu Asp Ser Gly Gly Lys Thr Leu Leu Leu Tyr 
145 150 155 160 

His Leu Val Val lie Ala Leu lie Tyr Phe Leu Pro lie Ala Leu Ala 
15 165 170 175 

Tyr Ser Val lie Gly Leu Thr Leu Trp Arg Arg Ala Val Pro Gly His 
180 185 190 

Gin Ala His Gly Ala Asn Leu Arg His Leu Gin Ala Lys Lys Lys Phe 
195 200 205 

2 0 Val Lys Thr Met Val Leu Val Val Val Thr Phe Ala lie Cys Trp Leu 

210 215 220 

Pro Tyr His Leu Tyr Phe lie Leu Gly Ser Phe Gin Glu Asp lie Tyr 
225 230 235 240 

Cys His Lys Phe lie Gin Gin Val Tyr Leu Ala Leu Phe Trp Leu Ala 
25 245 250 255 

Met Ser Ser Thr Met Tyr Asn Pro lie lie Tyr Cys Cys Leu Asn His 
260 265 270 

Arg Phe Arg Ser Gly Phe Axg Leu Ala Phe Arg Cys Cys Pro Trp Val 
275 280 285 

3 0 Thr Pro Thr Lys Glu Asp Lys Leu Glu Leu Thr Pro Thr Thr Ser Leu 

290 295 300 

Ser Thr Arg Val Asn Arg Cys His Thr Lys Glu Thr Leu Phe Met Ala 
305 310 315 320 

Gly Asp Thr Ala Pro Ser Glu Ala Thr Ser Gly Glu Ala Gly Arg Pro 
35 325 330 335 

Gin Asp Gly Ser Gly 
340 

(2) INFORMATION FOR SEQ ID NO : 4 9 : 
(i) SEQUENCE CHARACTERISTICS : 

4 0 (A) LENGTH : 34 0 amino acids 



SEQUENCE DESCRIPTION: SEQ ID NC : 4 h : 

Val Leu Trp Ala Ala Ala Tyr Thr Val He Val Val Arg Se: 
5 10 15 
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Val Gly Asn Val Val Val lie Trp lie lie Leu Ala His Lys Arg Met 

20 25 30 

Arg Thr Val Thr Asn Tyr Phe Leu Val Asn lie Ala Phe Ala Phe Ala 
35 40 45 

5 Leu Asn Thr Trp Asn Phe Thr Tyr Ala Val His Asn Val Trp Tyr Tyr 

50 55 60 

Gly Leu Phe Tyr Cys Lys Phe His Asn Phe Phe Pro lie Ala Ala Leu 
65 70 75 80 

Phe Ala Ser lie Tyr Ser Met Thr Ala Val Ala Phe Asp Arg Tyr Leu 
10 85 90 95 

lie lie His Pro Leu Gin Pro Arg Leu Ser Ala Thr Ala Thr Lys Val 

100 105 110 

Val lie Phe Val lie Trp Val lie Ala Leu Leu Leu Ala Ser Pro Gin 
115 120 125 

15 Gly Tyr Tyr Ser Thr Thr Glu Leu Ser Arg Val Val Cys Met lie Glu 

130 135 140 

Trp Pro Glu His Pro Asn Arg Thr Tyr Glu Lys Ala Tyr Hi^ lie Cys 
145 150 155 160 

Val Thr Val Leu lie Tyr Phe Leu Pro Leu Leu Val He Gly Tyr Ala 
20 165 170 175 

Tyr Thr Val Val Gly He Thr Leu Trp Ala Ser Glu He Pro Gly Asp 
180 185 190 

Ser Ser Asp Arg Tyr His Glu Gin Val Ser Ala Lys Arg Lys Val Val 
195 200 205 

2 5 Lys Met lie Cys Val Val Val Cys Thr Phe Ala He Cys Trp Leu Pro 

210 215 220 

Phe His Val Phe Phe Leu Leu Pro Tyr He Asn Pro Asp Leu Tyr Leu 

225 230 235 240 

Lys Lys Phe He Gin Gin Val Tyr He Ala Ser Met Trp Leu Ala Met 
30 245 250 255 

Ser Ser Thr Met Tyr Asn Pro He He Tyr Cys Cys Leu Asn Asp Arg 

260 265 270 

Phe Arg Leu Gly Phe Lys His Ala Phe Arg Cys Cys Pro Phe He Ser 
275 280 285 

3 5 Ala Gly Asp Tyr Glu Gly Leu Glu Met He Lys Ser Thr Arg Tyr Leu 

290 ' " 295 300 

Gin Thr Leu Ser Ser Val Tyr Lys Val Ser Arg Leu Glu Thr Thr He 
305 310 315 320 

Ser Thr Val Val Gly Ala His Glu Glu Glu Pro Glu Glu Gly Pro Lys 
40 325 330 335 

Ala Thr Pro Ser 
340 

(2) INFORMATION FOR SEQ ID NO: 50: 
(l) SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 33 6 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
5 lie Ala Leu Trp Ser Leu Ala Tyr Gly Leu* Val Val Ala Val Ala Val 

15 10 15 

Phe Gly Asn Leu lie Val lie Trp lie lie Leu Ala His Lys Arg Met 
20 25 30 

Arg Thr Val Thr Asn Tyr Phe Leu Val Asn Leu Ala Phe Ser Asp Ala 
10 35 40 45 

Ser Val Ala Ala Phe Asn Thr Leu lie Asn Phe lie Tyr Gly Leu His 
50 55 60 

Ser Glu Trp Tyr Phe Gly Ala Asn Tyr Cys Arg Phe Gin Asn Phe Phe 
65 70 75 80 

!5 Pro lie Thr Ala Val Phe Ala Ser lie Tyr Ser Met Ala lie Ala Val 

85 90 95 

Asp Arg Tyr Met Ala lie lie Asp Pro Leu Lys Pro Arg Leu Ser Ala 
100 105 110 

Thr Ala Thr Lys lie Val lie Gly Ser He Trp He Leu Ala Phe Leu 
20 115 120 125 

Leu Ala Phe Pro Gin Cys Leu Tyr Ser Lys He Leu Gly Arg Thr Leu 
130 135 140 

Cys Tyr Val Trp Pro Glu Gly Pro Lys Gin His Phe Thr Tyr His He 
145 150 155 160 

25 He Val He lie Leu Val Tyr Cys Phe Pro Leu Leu He Leu Thr Tyr 

165 170 175 

Thr He Val Gly He Thr Leu Trp Gly Gly Glu He Pro Gly Asp Thr 
180 185 190 

Cys Asp Lys Tyr His Glu Gin Leu Lys Ala Lys Arg Lvs Val Val Met 
30 195 200 205 

Asn He Val Val Val Thr Phe Ala He Cys Trp Leu Pro Tyr His Val 
210 215 220 

Tyr Phe lie Leu Thr Ala He Tyr Gin Gin Leu Asn Arg Trp Lys Tyr 
225 230 235 240 

3 5 He Gin Gin Val Tyr Leu Ala Ser Phe Trp Leu Ala Met Ser Ser Thr 

245 250 255 

Met Tyr Asn Pro He lie Tyr Cys Cys Leu Asn Lys Arg Phe Arq Ala 
260 265 270 

Gly Phe Lys Arg Ala Phe Arg Trp Cys Pro Phe lie Gin Val Ser Ser 

40 275 280 285 



?iv Asp Pre -hi lvs Ser Ser Arg Lvs Lvs Arg Ala Val 
325 330 " 335 
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(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met lie Pro Thr Leu Tyr Ser He He Phe Val Val Gly He Phe Gly 

10 1 5 10 15 

Asn Ser Leu Val Val He Val He Tyr Phe Tyr Met Lys Leu Lys Thr 
20 25 30 

Tyr Ala Ser Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe 
35 40 45 

15 Leu Leu Thr Leu Pro Leu Trp Ala Val Tyr Thr Leu Tyr Arg Trp Pro 

50 55 60 

Phe Gly Asn Tyr Leu Cys Lys He Ala Ser Ala Ser Val Ser Phe Asn 

65 70 75 80 

Leu Tyr Ala Ser Val Phe Leu Leu Thr Cys Leu Ser He Asp Arg Tyr 
20 85 90 95 

Leu Ala He Val His Pro Met Lys Ser Arg Leu Arg Arg Leu Val Ala 
100 105 110 

Lys Val Thr Cys He He He Trp Leu Leu Ala Gly He Ala Ser Leu 
115 120 125 

2 5 Pro Thr He He His Arg Asn Phe Phe lie Glu Asn Thr Asn He Thr 

130 135 140 

Val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro Val Gly 
145 150 155 160 

Leu Gly Leu Thr Lys Asn He Leu Gly Phe Leu Phe Pro Phe Leu lie 
30 165 170 175 

lie Leu Thr Ser Tyr Thr Leu He Trp Lys Thr Leu Lys Lys Ala Tyr 
180 185 ISj 

Glu He Gin Lys Asn Lys Pro Arg Lys Asp Asp lie Phe Lys lie lie 
195 200 205 

3 5 lie Ala He Val Leu Phe Phe Phe Phe Ser Trp Val Pro His Asn He 

210 215 220 

Phe Thr Phe Met Val Leu He Gin Leu Gly Leu lie Arg Asp Cys Lys 
225 230 ' 235 240 

He Glu Asp He Val Asp Thr Ala Met Pro He Thr He Cys Leu Ala 
40 245 250 255 

Tyr Phe Gin Gin Asn Leu Asn Pro Leu Phe Tyr Gly Phe Leu Gly Lys 
260 265 " " 270 

Lys Phe Lys Lys Tyr Phe Leu His Ala Leu Leu Lys Tyr He Pro Pro 
275 280 285 

4 5 Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser Thr Leu Ser 

290 295 300 
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Tyr Arg Pro Ser Glu Gin Gly Asn Ser Ser Thr Lys Lys Pro Ala Pro 
305 310 315 320 

Cys lie Glu Val Glu 
325 

5 (2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

He Val His Trp Val He Met Ser He Ser Pro Val Gly Phe Val Glu 
15 10 15 

15 Asn Gly He Leu Leu Trp Phe Leu Cys Phe Phe Thr Val Tyr Thr His 

20 25 30 

Leu Ser He Ala Asp lie Ser Leu Leu Phe Cys He Phe He Leu Ser 

O C a r\ ac 

— ' — ' 1U ~i ~i 

lie Asp Tyr Ala Leu Asp Tyr Glu Leu Ser Ser Gly His Tyr Tyr Thr 
20 50 55 60 

He Val Thr Leu Ser Val Thr Phe Leu Phe Gly Tyr Asn Thr Gly Leu 
65 70 75 80 

Tyr Leu Leu Thr Ala He Ser Val Glu Arg Cys Leu Ser Val Leu Tyr 
85 90 95 

2 5 Pro lie Trp Tyr Arg Cys His Arg Pro Lys Tyr Gin Ser Ala Leu Val 

100 105 110 

Cys Ala Leu Leu Trp Ala Leu Ser Cys Leu Val Thr Thr Me^ Tyr Val 
115 120 125 

Met Cys He Asp Arg Phe Glu Glu Ser His Ser Arg Asn Asp Cys Arg 
30 130 135 140 

Ala Val He He Phe He Ala He Leu Ser Phe Leu Val Phe Thr Pro 
145 150 155 160 

Ser Val Ser Ser Thr He Leu Val Val Lys lie Arg Lys Asn Thr Trp 

165 170 175 

3 5 Ala Ser His Ser Ser Lys Leu Tyr He Val lie Met Val Thr lie He 

180 185 190 

He Phe Leu He Phe Ala Met Pro Met Arg Leu Leu Tyr Leu Leu Tyr 
195 200 205 

Tyr Glu Tyr Trp Ser Thr Phe Gly Asn Leu His His He Ser Leu Leu 
40 210 215 ' 220 

Phe Ser Thr He Asn Ser Ser Ala Asn Pro Phe lie Tyr Phe Phe Val 
~ 23 0 235 24 0 



2 £ C 2 6 b 

:ys Asn Thr Val Thr Val Glu Thr Val Val 
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275 280 

(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
10 Tyr Asp Phe Leu Arg Val Leu lie Trp Leu lie Asn lie Leu Ala lie 

15 10 15 

Met Gly Asn Val Met Thr Leu Phe Val Leu Leu Thr Ser Arg Tyr Lys 
20 25 30 

Leu Thr Val Pro Arg Phe lie Met Asn Leu Ser Phe Ala Asp Phe Cys 
15 35 40 45 

Met Leu Tyr Leu Leu Leu lie Ala Ser Val Asp Ser Gin Thr Lys Gly 
50 55 60 

Gin Tyr Tyr Asn His Ala He Asp Trp Gin Thr Gly Ser Gly Cys Ser 
65 70 75 80 

2 0 Thr Ala Gly Phe Phe Thr Val Leu Ala Ser Glu Leu Ser Val Tyr Thr 

85 90 95 

Leu Thr Val He Thr Leu Glu Arg Trp His Thr He Thr Tyr Ala He 
100 105 110 

His He Asp Gin Lys Leu Arg Leu Arg His Ala He Leu He Met Leu 
25 115 120 125 

Gly Gly Trp Leu Phe Ser Ser Leu He Ala Met Leu Pro Leu Val Cys 
130 135 140 

Val Ser Asn Tyr Met Lys Val Ser He Cys Leu Pro Met Val Glu Thr 
145 150 155 160 

3 0 Thr Leu Ser Gin Val Tyr He Leu Thr He Leu He Leu Asn Val Val 

165 170 175 

Ala Phe Leu He He Cys Ala Cys Tyr lie Lys He Tyr Phe Ala Val 
180 185 19," 

Arg Asn Pro Glu He Met Ala Thr Asn Lys Asp Thr Lys He Ala Leu 
35 195 200 205 

Ala He Leu He Phe Thr Asp Phe Thr Cys Met Pro He Ser Phe Phe 
210 215 220 

Ala lie Ser Ala Ala Phe Lys Val Pro Leu lie Val Thr Asn Ser Lys 
225 230 235 240 

4 0 Val Leu Leu Val Leu Phe Tyr Pro He Asn Ser Cys Ala Asn Pro Phe 

245 250 255 

Leu Tyr Ala He Phe Thr Lys Thr Phe Gin Arg Asp Phe Phe lie Leu 
260 265 270 

Ser Lys Phe Cys Cys Lys Arg Arg Ala Asp He Tyr Arg Ar^ Lys Asp 
45 275 280 285 

Phe Ser Ala Tyr Thr Ser Asn Cys Lys Lys Gly Phe Thr Gly Ser Asn 
290 295 300 
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Lys Pro Ser Gin Ser Thr Leu Lys Leu Ser Thr Leu His Cys Gin Gly 
305 310 315 320 

Thr Ala Leu Leu Asp Lys Arg Arg Tyr Thr Glu Cys 
325 330 

5 (2) INFORMATION FOR SEQ ID NO: 54: 
(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Tyr Lys Phe Leu Arg lie Val Val Trp Phe Val Ser Leu Leu Ala Leu 
15 10 15 

15 Leu Gly Asn Val Phe Val Leu Leu lie Leu Leu Thr Ser His Tyr Lys 

20 25 30 

Leu Asn Val Pro Arg Phe lie Met Asn lie Ala Phe Ala Asp Phe Cys 
35 40 45 

Met Met Tyr Leu Leu Leu lie Ala Ser Val Asp Leu Tyr Thr His Ser 
20 50 55 60 

Glu Tyr Tyr Asn His Ala lie Asp Trp Gin Thr Gly Pro Gly Cys Asn 
65 70 75 80 

Thr Ala Gly Phe Phe Thr Val Phe Ala Ser Glu Leu Ser Val Tyr Thr 
85 90 95 

2 5 Leu Thr Val lie Thr Leu Glu Arg Trp Tyr Ala lie Thr Phe Ala Met 

100 105 110 

Arg Leu Asp Arg Lys lie Arg Leu Arg His Ala Cys Ala lie Met Val 
115 120 125 

Gly Gly Trp Val Cys Cys Phe Leu Leu Ala Leu Leu Pro Leu Val Gly 
30 130 135 140 

lie Ser Ser Tyr Ala Lys Val Ser lie Cys Leu Pro Met Thr Glu Thr 
145 150 155 160 

Pro Leu Ala Leu Ala Tyr lie Val Phe Val Leu Thr Leu Asn lie Val 

155 170 175 

3 5 Ala Phe Val lie Val Cys Cys Cys Tyr Val Lys lie Tyr lie Thr Val 

180 185 190 

Arg Asn Pro Gin Tyr Asn Pro Gly Asp Lys Asp Thr Lys lie Ala Lys 
195 200 205 

Arg Met Ala Val Leu lie Phe Thr Asp Phe lie Cys Met Ala Pro lie 
40 210 215 220 

Ser Phe Tyr Ala Leu Ser Ala lie Leu Asn Lys Pro Leu lie Thr Val 

2 2 5 2 3 0 2 3 5 7 4 0 



265 
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Val Phe lie Leu Leu Ser Lys Phe Gly lie Cys Lys Arg Gin Ala Gin 
275 280 285 

Ala Tyr Arg Gly Gin Arg Val Pro Pro Lys Asn Ser Thr Asp lie Gin 
290 295 300 

5 Val Gin Lys Val Thr His Asp Met Arg Gin Gly Ala Leu Asn Met Glu 

305 310 315 320 

Asp Val Val Glu Leu lie Glu Asn Ser His Leu Thr Pro Lys Lys Gin 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 55: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Tyr Asn lie Leu Arg Val Leu lie Trp Phe lie Ser lie Leu Ala lie 

15 10 15 

Thr Gly Asn lie lie Val Leu Val lie Leu Thr Thr Ser Gin Tyr Lys 
20 20 25 30 

Leu Thr Val Pro Arg Phe Leu Met Asn lie Ala Phe Ala Asp Leu Cys 
35 40 45 

lie Gly He Tyr Leu Leu Leu He Ala Ser Val Asp He His Thr Lys 
50 55 60 

25 Ser Gin Tyr His Asn Tyr Ala He Asp Trp Gin Arg Gly Ala Gly Cys 

65 70 75 80 

Asp Ala Ala Gly Phe Phe Thr Val Phe Ala Ser Glu Leu Ser Val Tyr 

85 90 95 

Thr Leu Thr Ala He Thr Leu Glu Arg Trp His Thr He Thr His lie 
30 100 105 110 

Met Gin He Asp Cys Lys Val Gin Leu Arg His Ala Ala Ser Val Met 
115 120 125 

Val Met Gly Trp He Phe Ala Phe Ala Ala Ala Leu Phe Pro He Phe 
130 135 140 

3 5 Gly He Ser Ser Tyr Met Lys Val Ser He Cys Leu Pro Leu lie Asp 

145 150 155 160 

Ser Pro Leu Ser Gin Leu Tyr Val Met Ser Leu Leu Val Leu Asn Val 
165 170 175 

Leu Ala Phe Val Val He Cys Gly Cys Tyr Thr His He Tyr Leu Thr 
40 180 185 19j 

Val Arg Asn Pro A*sn He Val Ser Ser Ser Ser Asp Thr Arg He Ala 
195 200 205 

Lys Arg Met Leu He Phe Thr Asp Phe Leu Leu Pro He Ser Phe Phe 
210 215 220 

45 Ala He Ser Ala Ser Leu Lys Val Pro Leu He Thr Val Ser Lys Ala 

225 230 235 240 

Lys lie Leu Leu Val Leu Phe Hxs Pro lie Asn Ser Cys Ala Asn Pro 
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245 250 255 

Phe Leu Tyr Ala lie Phe Thr Lys Asn Phe Arg Arg Asp Phe Phe lie 

260 265 270 

Leu Leu Ser Lys Cys Gly Cys Tyr Glu Met Gin Ala Gin lie Tyr Arg 
5 275 280 285 

Thr Glu Thr Ser Ser Thr Val His Asn Thr His Pro Arg Asn Gly His 
290 295 300 

Cys Ser Ser Ala Pro Arg Val Thr Ser Gly Ser Ser Arg Tyr lie Leu 

305 310 315 320 

10 Val Pro Leu Ser Leu Gin Asn 

325 

(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

2 0 Ser Met Leu Ala Ala Tyr Met Phe Leu Leu lie Val Leu Gly Phe Pro 

15 10 15 

lie Asn Phe Leu Thr Leu Tyr Val Thr Val Gin His Lys Lys Leu Arg 
20 25 30 

Thr Pro lie Asn Tyr lie Leu Leu Asn Leu Ala Val Ala Asp Leu Phe 
25 35 40 45 

Met Val Leu Gly Gly Phe Thr Ser Thr Leu Tyr Thr Ser Leu His Gly 
50 55 60 

Tyr Phe Val Phe Gly Pro Thr Gly Cys Asn Leu Glu Gly Ph<- Phe Ala 
65 70 75 80 

3 0 Thr Leu Gly Gly Clu lie Ala Leu Trp Ser Leu Trp Leu Ala lie Glu 

85 SO 95 

Arg Tyr Val Val Val Cys Lys Pro Met Ser Asn Phe Arg Phe Gly Glu 
100 105 110 

Asn His Ala lie Met Gly Val Ala Phe Thr Trp Val Met Ala Leu Ala 

35 115 120 125 

Cys Ala Ala Pro Pro lie Ala Gly Trp Ser Arg Tyr lie Pro Glu Gly 
130 135 140 

Leu Gin Cys Ser Cys Gly lie Asp Tyr Tyr Thr Leu Lys Pro Glu Val 
145 150 155 16G 



4 0 Asn Asn Glu Ser Phe Val lie Tyr Met Phe Val Val His Phe Thr lie 

165 170 175 



G:u Val Thr A: g Me Va ^ Zie I - e Me : '.'a . lie 
210 " 215 22 C 
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Cys Trp Val Pro Tyr Ala Ser Val Ala Phe Tyr lie Phe Thr His Gin 

225 230 235 240 

Gly Ser Asn Phe Gly Pro lie Phe Met Arg lie Pro Ala Phe Phe Ala 
245 250 255 

5 Lys Ser Ala Ala lie Tyr Asn Pro Val lie Tyr lie lie Phe Asn Lys 

260 265 270 

Gin Phe Arg Asn Cys Met Leu Gin Leu lie Cys Cys Gly Lys Asn Pro 
275 280 285 

Leu Gly Asp Asp Glu Ala Ser Ala Thr Val Ser Lys Arg Glu Thr Ser 
10 290 295 300 

Gin Val Ala Pro Ala 
305 

(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 29 7 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met lie Phe Val Val lie Ala Ser Val Phe Thr Asn Gly Leu Val Leu 
15 10 15 

Ala Ala Thr Met Lys Phe Lys Lys Leu Pro His Pro lie Asn Trp lie 
20 25 30 

2 5 Leu Val Asn Leu Ala Val Ala Asp lie Ala Gly Thr Val lie Ala Ser 

35 40 45 

Thr lie Ser Val Val Asn Gin Val Tyr Gly Tyr Phe Val Leu Gly His 

50 55 60 

Pro Met Cys Val Leu Glu Gly Tyr Thr Val Ser Leu Cys Gly lie Thr 
30 65 70 75 80 

Gly Leu Trp Ser Leu Ala lie lie Ser Trp Glu Arg Trp Met Val Val 
85 90 95 

Cys Lys Pro Phe Gly Asn Val Arg Phe Asp Ala Lys lie Ala lie Val 
100 105 110 

3 5 Gly lie Ala Phe Ser Trp lie Trp Ala Ala Val Trp Thr Aid Pro Pro 

115 120 125 

lie Phe Gly Trp Ser Arg Tyr Trp Pro His Gly Leu Lys Thr Ser Cys 
130 135 140 

Gly Pro Asp Val Phe Ser Glv Ser Ser Tyr Pro Gly Val Gin Ser Leu 
40 145 150 155 160 

Leu Cys lie Thr Pro Leu Ser lie lie Val Leu Cys Tyr Leu Gin Val 
165 170 175 

Trp Thr Ala He Arg Ala Val Ala Lys Gin Gin Lys Glu Ser Glu Ser 
180 185 190 

4 5 Thr Gin Lys Ala Glu Lys Glu Val Thr Arg Met Trp Val Met Val Leu 

195 200 205 

Ala Phe Cys Phe Cys Trp Gly Pro Tyr Ala Phe Phe Ala Cys Phe Ala 
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210 215 220 

Ala Ala Asn Pro Gly Tyr Pro Phe His Pro Leu Met Ala Ala Leu Pro 
225 230 235 240 

Ala Phe Phe Ala Lys Ser Ala Thr lie Tyr Asn Pro Val lie Tyr Val 
5 245 250 255 

Phe Met Asn Arg Gin Phe Arg Asn Cys lie Leu Gin Leu Phe Gly Lys 
260 265 270 

Lys Val Asp Asp Gly Ser Glu Leu Ser Ser Ala Ser Lys Thr Glu Val 
275 280 285 

10 Ser Ser Val Ser Ser Val Ser Pro Ala 

290 295 

(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

2 0 Arg Cys Phe Val Val Thr Ala Ser Val Phe Thr Asn Gly Leu Val Leu 

15 10 15 

Ala Ala Thr Met Lys Phe Lys Lys Leu Arg His Pro Leu Asn Trp lie 
20 25 30 

Leu Val Asn lie Ala Val Ala Asp lie Ala Gly Thr Val lie Ala Ser 
25 35 40 45 

Thr lie Ser lie Val Asn Gin Val Ser Gly Tyr Phe Val Leu Gly His 
50 55 60 

Pro Met Cys Val Leu Glu Gly Tyr Thr Val Ser Leu Cys Gly lie Thr 
65 70 75 80 

3 0 Gly Leu Trp Ser Leu Ala lie lie Ser Trp Glu Arg Trp Leu Trp Cys 

85 90 95 

Lys Pro Phe Gly Asn Val Arg Phe Asp Ala Lys lie Ala lie Val Gly 
100 105 110 

He Ala Phe Ser Trp He Trp Ser Ala Val Trp Thr Ala Pro Pro He 
35 115 120 125 

Phe Gly Trp Ser Arg Tyr Trp Pro His Gly Leu Lys Thr Ser Cys Gly 
130 135 140 

Pro Asp Val Phe Ser Gly Ser Ser Tyr Pro Gly Val Gin Ser Leu Val 
145 150 155 160 

40 He Met Val Thr Cys Cys He He Pre He Ala He He Leu Cys Tyr 

165 170 175 



Pre 
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Ala Ala Asn Pro Gly Tyr Ala Phe His Pro Leu Met Ala Ala Leu Pro 
225 230 235 240 

Ala Tyr Phe Ala Lys Ser Ala Thr He Tyr Asn Pro Val He Tyr Val 
245 250 255 

5 Phe Met Asn Arg Gin Phe Arg Asn Cys He Leu Gin Leu Phe Gly Lys 

260 265 270 

Lys Val Asp Asp Gly Ser Glu Leu Ser Ser Ala Ser Lys Thr Glu Val 
275 280 285 

Ser Ser Val Ser Ser Val Ser Pro Ala 
10 290 295 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

Gin Ala Ala Phe Met Gly Thr Val Phe Leu He Gly Phe Pro Leu Leu 
20 1 5 10 15 

Val Ala Thr Leu Ala Tyr Lys Lys Leu Arg Gin Pro Asn Tyr He Leu 
20 25 30 

Val Asn Val Ser Phe Gly Gly Phe Leu Leu Cys He Phe Ser Val Phe 
35 40 45 

2 5 Pro Val Phe Val Ala Ser Cys Asn Gly Tyr Phe Val Phe Gly Arg His 

50 55 60 

Val Cys Ala Leu Glu Gly Phe Leu Gly Thr Val Ala Gly Leu Val Thr 

65 70 75 80 

Gly Trp Ser Leu Ala Phe Leu Ala Phe Glu Arg Tyr He Val He Cys 
30 85 90 95 

Lys Pro Phe Gly Asn Phe Arg Phe Ser Ser Lys His Ala Leu Thr Val 
100 105 110 

Val He Ala Thr Trp Thr He Gly He Gly Val Ser He Pro Pro Phe 
115 120 125 

3 5 Phe Gly Trp Ser Arg Phe He Pro Glu Gly Leu Gin Cys Ser Cys Gly 

130 135 140 

Pro Asp Lys Tyr Thr Val Gly Thr Lys Tyr Arg Ser Glu Ser Tyr Thr 
145 150 155 160 

Trp Phe Leu Phe He Phe Cys Phe He Val Pro Leu Ser Leu He Cys 
40 165 170 175 

Phe Ser Tyr Thr Gin Leu Leu Arg Ala Leu Lys Ala Val Ala Ala Gin 
180 185 190 

Gin Gin Glu Ser Ala Thr Thr Gin Lys Ala Glu Arg Glu Val Ser Arg 
195 200 205 

4 5 Met Val Val Val Met Val Gly Ser Phe Cys Val Cys Tyr Val Pro Tyr 

210 215 220 

Ala Ala Phe Ala Met Tyr Met Val Asn Asn Arg Asn His Gly Leu Asp 
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225 230 235 240 

Leu Arg Leu Val Arg lie Pro Ser Phe Phe Ser Lys Ser Ala Cys lie 
245 250 255 

Tyr Asn Pro lie lie Tyr Cys Phe Met Asn Lys Gin Phe Gin Ala Cys 
5 260 265 270 

lie Met Met Val Cys Gly Lys Ala Met Met Glu Ser Asp Thr Cys Ser 
275 280 285 

Ser Gin Lys Thr Glu Val Ser Thr Val Ser Ser Thr Gin Val Gly Pro 
290 295 300 

10 Asn 
305 

(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 
15 (B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
(n) MOLECUIiE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

2 0 Leu lie Tyr Gly Leu Phe Leu Ser Met Tyr Leu Val Thr Val lie Gly 

15 10 15 

Asn lie Ser lie lie Val Ala lie lie Ser Asp Pro Cys Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Asn Leu Ser Phe Val Asp lie Cys Phe 
25 35 40 45 

He Ser Thr Thr Val Pro Val Asn Thr Gin Thr Gin Asn Asn Val He 
50 55 60 

Thr Tyr Ala Gly Cys lie Thr Gin He Tyr Phe Phe Leu Leu Phe Val 
65 70 75 80 

3 0 Glu Leu Asp Asn Phe Leu Leu Thr He Met Ala Tyr Asp Arg Tyr Val 

85 90 95 

Ala He Cys His Pro Met His Tyr Thr Val He Met Asn Tyr Lys Leu 
100 105 110 

Cys Gly Phe Leu Val Leu Val Ser Trp He Val Ser Val Leu His Ala 
35 115 120 125 

Leu Phe Gin Ser Leu Ala Leu Pro Phe Cys Thr His Leu Glu He Pro 
130 135 140 

Hxs Tyr Phe Cys Glu Pro Asn Gin Val He Gin Leu Thr Cys Ser Asp 
145 150 155 160 

4 0 Ala Phe Leu Asn Asp Leu Val He Tyr Phe Thr Leu Val Leu Leu Ala 

165 170 175 

Thr Val Pro He Ala Glv He Phe Tvr Ser Tvr Phe Ala He Ser Ser 



val 
210 



jeu Pne 
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Ala Ala Asn Asn Ser Leu Ser Ala Thr Ala Ser Val Met Tyr Thr Val 

225 230 235 240 

Val Thr Pro Met Val Asn Pro Phe lie Tyr Ser Leu Arg Asn Lys Asp 
245 250 255 

5 Val Lys Ser Val Leu Lys Lys Thr Leu Cys Glu Glu Val lie Arg Ser 

260 265 270 

Pro Pro Ser Leu Leu His Phe Phe Leu Val Leu Cys His Leu Pro Cys 
275 280 285 

Phe lie Phe Cys Tyr 
10 290 

(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Leu Leu Phe Leu Leu Phe Leu lie Met Tyr Leu Ala Thr Val Leu Gly 
20 1 5 10 15 

Asn Leu Leu lie lie Leu Ala lie Gly Gly Asp Ser Arg Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Asn Leu Ser Phe Val Asp Val Cys Phe 
35 40 45 

2 5 Ser Ser Thr Thr Val Pro Lys Val Leu Ala Asn His lie Leu Gly Ser 

50 55 60 

Gin Ala lie Ser Phe Ser Gly Cys Leu Thr Gin Leu Tyr Phe Leu Ala 
65 70 75 80 

Val Phe Gly Asn Met Asp Asn Phe Leu Leu Ala Val Met Ser Tyr Asp 
30 85 90 95 

Arg Tyr Val Ala lie Cys His Pro Leu His Tyr Thr Thr He Arg Gin 
100 105 110 

Leu Cys Val Leu Leu Val Val Gly Ser Trp Val Val Ala Asn Met Asn 
115 120 125 

3 5 Cys Leu Leu His He Leu He Met Ala Arg Lys Ser Phe Cys Ala Asp 

130 135 140 

Leu Pro His Phe Phe Cys Asp Gly Thr Pro Leu Leu Lys Leu Ser Cys 
145 150 155 160 

Ser Asp Thr His Leu Asn Glu Leu Met He Leu Thr Glu Gly Ala Val 
40 165 170 175 

Val Met Val Thr Pro Phe Val Cys He Leu He Ser Tyr He His He 
180 185 190 

Thr Cys Ala Val Leu Arg Val Ser Ser Pro Arg Gly Gly Trp Lys Ser 
195 200 205 

4 5 Phe Ser Thr Cys Cly Ser His He Ala Val Val Cys Leu Phe Tyr Gly 

210 215 220 

Thr Val He Ala Val Tyr Phe Asn Pro Ser Ser Ser His Leu Ala Gly 
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225 230 235 240 

Arg Asp Met Ala Ala Ala Val Met Tyr Ala Val Val Thr Pro Met lie 
245 250 255 

Asn Pro Phe lie Tyr Ser Leu Arg Asn Ser Asp Met Lys Ala Ala Leu 
5 260 265 270 

Arg Lys Val Leu Ala Met Arg Phe Pro Ser Lys Gin 
275 280 

(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Leu Leu Phe Leu Leu Phe Leu Val Met Tyr Leu Leu Thr Val Val Gly 
15 10 15 

Asn Leu Ala lie lie Ser Leu Val Gly Ala His Arg Cys Leu Gin Pro 
20 25 30 

2 0 His Thr Pro Met Tyr Phe Phe Leu Cys Asn Leu Ser Phe Leu Glu lie 

35 40 45 

Trp Phe Thr Thr Ala Cys Val Pro Lys Thr Leu Ala Thr Phe Ala Pro 
50 55 60 

Arg Gly Gly Val lie Ser Leu Ala Gly Cys Ala Thr Lys Tyr Phe Val 
25 65 70 75 80 

Phe Ser Leu Gly Cys Thr Glu Tyr Phe Leu Leu Ala Val Met Ala Tyr 
85 90 95 

Asp Arg Tyr Leu Ala lie Cys Leu Pro Leu Arg Tyr Gly Gly lie Met 
100 105 110 

3 0 Arg Pro Gly lie Ala Met Arg Leu Ala Leu Gly Ser Trp Leu Cys Gly 

115 120 125 

Phe Ser Ala He Thr Val Pro Ala Thr Leu He Ala Arg Leu Ser Phe 
130 135 140 

Cys Gly Ser Arg Val lie Asn His Phe Phe Cys Asp He Ser Pro Trp 
35 145 150 155 160 

He Val Leu Ser Cys Thr Asp Thr Gin Val Val Glu Leu Val Ser Phe 
165 170 175 

Glv He Ala Phe Cvs Val lie Leu Gly Ser Cys Gly lie Thr Leu Val 
180 185 190 

4 0 Ser Tyr Ala Lys He Pro Ser Ala Arg Gly Arg His Arg Ala Phe Ser 

195 200 205 



Lys Ala He Thr Val Leu Asr: Thr He Val Thr Pro Val Leu Asn Pre 
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245 250 255 

Phe lie Tyr Thr Leu Arg Asn Lys Asp Val Lys Glu Ala Leu Arg Arc 
260 " 265 270 

Thr Val Lys Gly Lys 
5 275 

(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
10 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu lie Phe Ala Leu Phe Leu Ser Met Tyr Leu Val Thr Val Leu Gly 
15 1 5 10 15 

Asn Leu Leu He He Met Ala He He Thr Gin Ser His Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Phe Val Asp He Cys Phe Thr Ser Thr 
35 40 45 

2 0 Thr He Pro Leu Val Asn He Tyr Thr Gin Ser Lys Ser He Thr Tyr 

50 55 60 

Glu Asp Cys He Ser Leu Val Phe Ala Glu Leu Gly Asn Phe Leu Leu 
65 70 75 80 

Ala Val Met Ala Tyr Asp Arg Tyr Val Ala Xaa Cys His Pro Leu Cys 
25 85 90 95 

Tyr Thr Val He Val Asn His Arg Leu Cys He Leu Leu Le- Leu Leu 
100 105 110 

Ser Trp Val He Ser He Phe Arg Ala Phe He Gin Ser Leu He Val 
115 120 125 

3 0 Leu Gin Leu Thr Phe Cys Gly Asp Val Lys He Pro His Phe Phe Cys 

130 135 140 

Glu Leu Asn Gin Leu Ser Gin Leu Thr Cys Ser Asp Asn Phe Pro Ser 
145 150 155 160 

His Leu He Met Asn Leu Val Pro Val Met Leu Ala Ala He Ser Phe 
35 165 170 175 

Ser Gly He Leu Tyr Ser Tyr Phe Ser He Ser Thr Val Gin Gly Lys 
180 185 190 

Tyr Lys Ala Phe Ser Thr Cys Ala Ser His Leu Ser He Val Ser Leu 
195 200 205 

4 0 Phe Tyr Ser Thr Gly Leu Gly Val Tyr Val Ser Ser Ala Val Val Gin 

210 215 220 

Ser Ser His Ser Ala Ala Ser Ala Ser Val Met Tyr Thr Val Val Pro 

225 230 235 240 

Met Leu Asn Pro Phe lie Tyr Ser Leu Arg Asn Lvs Asp Val Lys Arg 
45 245 250 ' 255 

Ala Leu Glu Arg Leu Leu Glu Gly Asn Cys Lys Val His His Trp Thr 
260 265 ' " 270 
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Gly 



(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Phe Tyr Ala Leu Phe Leu Val Met Tyr Leu Thr Thr lie Leu Gly 
15 10 15 

Asn Leu Leu lie lie Val Leu Val Gin Leu Asp Ser Gin Leu His Thr 
20 25 30 

15 Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 

35 40 45 

Ser Ser Leu Lys Leu Leu Gin Asn Met Arg Ser Gin Asp Thr Ser lie 
50 55 60 

Pro Tyr Gly Gly Cys Leu Ala Gin Thr Tyr Phe Phe Met Val Phe Gly 
20 65 70 75 80 

Asp Leu Ser Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala 
85 90 95 

lie Cys Phe Leu Pro His Tyr Thr Ser lie Met Ser Pro Lys Leu Cys 
100 105 110 

25 Thr Cys Leu Val Leu Leu Leu Trp Met Leu Thr Thr Ser His Met Met 

115 120 125 

Thr Leu Leu Ala Ala Arg Leu Ser Phe Cys Glu Asn Asn Trp Leu Asn 
130 135 140 

Phe Phe Cys Asp Leu Phe Val Leu Leu Lys lie Ala Cys Ser Asp Thr 
30 145 150 155 160 

Tyr lie Asn Glu Leu Phe lie Met Ser Thr Leu Leu lie lie lie Pro 
165 170 175 

Phe Phe Leu lie Val Met Ser Tyr Ala Lys Val Pro Ser Thr Gin Gly 
180 185 190 

3 5 lie Cys Lys Val Phe Ser Thr Cys Gly Ser His Leu Ser Val Val Ser 

195 200 205 

Leu Phe Tyr Gly Thr lie He Gly Leu Tyr Leu Cys Pro Ala Gly Asn 
210 215 220 

Asn Ser Thr Val Lys Glu Met Val Met Ala Met Met Tyr Thr Val Val 
40 225 230 235 240 

Thr Pro Met He Asn Pro Phe He Tyr Ser Leu Arg Asn Arg Asp Leu 

245 250 255 



iA) LENGTH: ^86 ammo acias 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
5 Leu Leu Phe Phe Leu Ser Leu Leu Xaa Tyr Val Leu Val Leu Thr Glu 

15 10 15 

Asn Met Leu lie lie lie Ala lie Arg Asn His Pro Thr Leu His Lys 
20 25 30 

Pro Met Tyr Phe Phe Leu Phe Leu Glu lie Trp Tyr Val Thr Val Thr 
10 35 40 45 

lie Pro Lys Leu Met Gly Phe lie Gly Ser Lys Glu Asn His Gly Gin 
50 55 60 

Leu lie Ser Phe Phe Ala Cys Met Thr Gin Leu Tyr Phe Phe Leu Gly 

65 70 75 80 

15 Leu Gly Cys Thr Glu Cys Val Leu Leu Ala Val Met Ala Tyr Asp Arg 

85 90 95 

Tyr Val Ala lie Cys His Pro Leu His Tyr Pro Val lie Val Ser Ser 
100 105 110 

Arg lie Glx Val Leu Gly Ser Trp Ala Gly Gly Phe Gly lie Ser Met 
20 115 120 125 

Val Lys Val Phe Leu lie Ser Arg Leu Ser Tyr Cys Gly Pro Asn Thr 
130 135 140 

lie Asn His Phe Phe Cys Asp Val Ser Pro Leu Leu Asn Leu Ser Cys 
145 150 155 160 

2 5 Thr Asp Met Ser Thr Ala Glu Leu Thr Asp Phe Val lie Ala lie Phe 

165 170 175 

lie Leu Leu Gly Pro Leu Ser Val Thr Gly Ala Ser Tyr Met Arg lie 
180 185 190 

Pro Ser Ala Ala Gly Arg His Lys Ala Phe Ser Thr Cys Ala Ser His 

3 0 195 200 205 

Leu Thr Val Val lie lie Phe Tyr Ala Ala Ser lie Phe lie Tyr Ala 
210 215 220 

Arg Pro Lys Ala Leu Ser Ala Phe Thr Asp Asn Lys Leu Va.-. Ser Val 
225 230 235 240 

35 Leu Tyr Ala Val ^le Val Pro Leu Phe Asn Pro lie lie Tyr Cys Leu 

245 250 255 

Arg Asn Gin Asp Val Lys Arg Ala Leu Arg Arg Thr Leu His Leu Ala 

260 265 270 

Gin Asp Gin Glu Ala Asn Thr Asn Lys Gly Ser Lys lie Gly 
40 275 280 285 

(2) INFORMATION FOR SEQ ID NO: 66: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 6 : 

Leu Phe Phe Ala Leu Phe Leu lie Met Tyr Leu Thr Thr Phe Leu Gly 

15 10 15 

Asn Leu Leu lie Val Val Leu Val Gin Leu Asp Ser His Leu His Thr 
5 20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 
35 40 45 

Ser Ser Val Thr Met Leu Lys Leu Leu Gin Asn lie Gin Ser Gin Val 
50 55 60 

10 Pro Ser lie Ser Tyr Ala Gly Cys Leu Trp lie Phe Phe Phe Leu Leu 

65 70 75 80 

Phe Gly Tyr Leu Gly Asn Phe Leu Leu Val Ala Met Ala Tyr Asp Arg 
85 90 95 

Tyr Val Ala lie Cys Phe Pro Leu His Tyr Thr Asn lie Met Ser His 
15 100 105 110 

Lys Leu Cys Thr Cys Leu Leu Leu Val Phe Tn> lie Met Arg Ser Ser 
115 120 " 125 

His Ala Met Met lie Thr Leu lie Ala Ala Arg Leu Ser Phe Cys Glu 
130 135 140 

2 0 Asn Asn Val Leu Leu Asn Phe Phe Cys Asp Leu Phe Val Leu Leu Lys 

145 150 155 160 

Leu Ala Cys Ser Asp Thr Tyr Val Asn Glu Leu Met lie His lie Met 

-55 170 175 

Glu Val lie lie lie Val lie Pro Phe Val Leu lie Val He Ser Tyr 
25 180 185 190 

Ala Lys Val Pro Ser Thr Gin Ser He His Lys Val Phe Ser Thr Cys 
195 200 205 

Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr He He Gly 
210 215 220 

3 0 Leu Tyr Leu Cys Pro Ser Gly Asp Asn Phe Ser Leu Lys Gly Ser Leu 

225 230 235 240 

Thr Val Val Thr Pro He Met Pro Phe He Tyr Ser Leu Arg Asn Arg 

245 250 255 

Asp Met Lys Gin Ala Leu He Arg Val Thr Cys Ser Lys Lys He Ser 

35 260 265 270 

Leu Pro Trp 
275 

(2) INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS : 

4 0 {A) LENGTH: 284 amino acids 

( B } TYPE: amino acid 

(C) STRANDEDNESS : sinqle 
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Asn Leu lie lie lie lie Leu lie Leu Leu Asp Ser His Leu His Thr 
20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ala Asp Leu Cys Phe 
35 40 45 

5 Ser Ser Leu Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser lie 

50 55 60 

Pro Tyr Ala Gly Cys Leu Ala Gin lie Tyr Phe Phe Leu Phe Phe Gly 
65 70 75 80 

Asp Leu Gly Asn Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val 
10 85 90 95 

Ala lie Cys Phe Pro Leu His Tyr Met Ser lie Met Ser Pro Lys lie 
100 105 110 

Glx Val Ser Leu Val Val Leu Ser Trp Val Leu Thr Thr Phe His Ala 

115 120 125 

15 Met Leu His Thr Leu lie Met Ala Arg Leu Ser Phe Cys GL. Asp Ser 

130 135 140 

Val lie Pro His Tyr Phe Cys Asp Met Ser Thr Leu Leu Lys Val Ala 
145 150 155 160 

Cys Ser Asp Thr His Asp Asn Glu Leu Ala lie Phe lie Leu Gly Gly 
20 165 170 175 

Pro lie Val Val Leu Pro Phe Leu Leu lie lie Val Ser Tyr Ala Arg 
180 185 190 

lie Val Ser Ser lie Phe Lys Val Pro Ser Ser Gin Ser lie His Lys 
195 200 205 

2 5 Ala Phe Ser Thr Cys Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr 

210 215 220 

Gly Thr Val lie Gly Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Glu 
225 230 235 240 

Val Lys Glu Thr Val Met Ser lie Tyr Thr Met Val Pro Met Leu Asn 

30 245 250 255 

Pro Phe lie Tyr Ser Leu Arg Asn Arg Asp lie Lys Asp Ala Leu Glu 
260 265 270 

Lys lie Met Cys Lys Lys Gin lie Pro Ser Phe Leu 
275 280 

3 5 (2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 8 : 

Leu Phe Tyr Ala Leu Phe Leu Ala Met Tvr Leu Thr lie lie Leu Gly 
1 5 10 15 

4 5 Asn Leu Leu lie lie Val Leu Val Arg Leu Asp Ser His Leu His Met 

20 25 " 30 
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Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe Ser Ser 
35 40 45 

Val Thr Trp Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser lie 
50 55 60 

5 Ser Tyr Thr Gly Cys Leu Thr Gin Leu Tyr Phe Phe Met Val Phe Gly 

65 70 75 80 

Asp Trp Ser Phe Leu Leu Val Val Met Ala Tyr Asp Arg Tyr Val Ala 
85 90 95 

lie Cys Phe Pro Leu Arg Tyr Thr Thr lie Met Ser Thr Lys Phe Cys 
10 100 105 110 

Ala Ser Leu Val Leu Leu Leu Trp Met Leu Thr Met Arg His Ala Leu 
115 120 125 

Leu His Thr Leu Leu He Ala Arg Leu Ser Phe Cys Glu Asp Ser Val 
130 135 140 

15 He Leu His Phe Phe Cys Asp He Ser Ala Leu Leu Lys Leu Ser Cys 

145 150 155 160 

Ser Asp He Tyr Val Asn Glu Leu Met He Tyr He Leu Gly Gly Leu 
165 170 175 

He lie He He Pro Phe Leu Leu He Val Met Ser Tyr Val Arg He 
20 180 185 190 

Phe Phe Ser lie Leu Lys Phe Pro Ser He Gin Asp He Ty- Lys Val 
195 200 205 

Phe Ser Thr Cys Gly Ser His Leu Ser Val Val Thr Leu Phe Tyr Gly 
210 215 220 

25 Thr He Phe Gly He Tyr Leu Cys Pro Ser Gly Asn Asn Ser Thr Val 

225 230 235 240 

Lys Glu He Leu Thr Val Val Thr Pro Met He Asn Pro Phe He Tyr 
245 250 255 

Ser Leu Arg Asn Arg Asp Trp Arg Ala Leu He Arg Val He Cys Thr 
30 260 265 270 

Lys Lys He Ser Leu 

275 



(2) INFORMATION FOR SEQ ID NO: 69: 
<i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 274 ammo acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
< D ) TOPOLOGY : 1 inear 

(n) MOLECULE TYPE: peptide 

4 0 (xi ) SEQUENCE DESCRIPTION; SEQ ID NO .69: 

Val Phe Tyr Ala Leu Phe Leu Ser Met Tyi 



be i 
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Ser Ser Leu Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser lie 
50 55 60 

Pro Phe Ala Gly Cys Leu Thr Gin Leu Tyr Phe Tyr Leu Tyr Phe Ala 
65 70 75 80 

5 Asp Leu Glu Ser Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val 

95 90 95 

Ala lie Cys Phe Pro Leu His Tyr Met Ser lie Met Ser Pro Lys Leu 
100 105 110 

Cys Val Ser Leu Trp Leu Ser Trp Val Leu Thr Thr Phe His Ala Met 
10 115 120 125 

Leu His Thr Leu lie Met Ala Arg Leu Ser Phe Cys Ala Asp Leu Pro 
130 135 140 

His Phe Phe Cys Asp lie Ser Pro Leu Leu Lys Leu Ser Cys Ser Asp 
145 150 155 160 

15 Thr His Val Asn Glu Leu Val lie Phe Leu Gly Leu Val He Val He 

165 170 175 

Pro Phe Val Leu He He Val Ser Tyr Ala Arg Val Val Ala Ser He 
180 185 190 

Leu Lys Val Pro Ser Val Arg Gly He His Lys He Phe Ser Thr Cys 
20 195 200 205 

Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr He He Gly 
210 215 220 

Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Thr Val Lys Glu Thr Leu 
225 230 235 240 

2 5 Thr Val Val Thr Pro Leu Pro Phe He Tyr Ser Leu Arg Asn Arg Asp 

245 250 255 

Met Lys Glu Ala Leu He Arg Val Leu Cys Lys Lys Lys lie Thr Phe 
260 265 270 



30 



Cys Leu 



(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Ala He Ala Val Leu Ser Leu Thr Leu Leu Gly Thr Phe Thr Val 

40 1 5 10 15 

Leu Glu Asn Leu Leu Val Leu Cys Val He Leu His Ser Arg Ser Leu 
20 25 30 

Arg Cys Arg Pro Ser Tyr His Phe He Gly Ser Leu Ala Val Ala Asp 
35 40 45 

4 5 Leu Leu Gly Ser Val He Phe Val Tyr Ser Phe Val Asp Phe His Val 

50 55 60 

Phe His Arg Lys Asp Ser Pro Asn Val Phe Leu Phe Lys Leu Gly Gly 

65 70 75 80 
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Val Thr Ala Ser Phe Thr Ala Ser Val Gly Ser Leu Phe Leu Thr Ala 
85 90 95 

lie Asp Arg Tyr lie Ser lie His Pro Pro lie Ala Tyr Lys Arg lie 
100 105 110 

5 Val Arg Arg Pro Lys Ala Val Val Ala Phe Cys Leu Met Thr lie Ala 

115 120 125 

lie Val lie Ala Val Leu Pro Leu Leu Gly Trp Asn Cys Lys Lys Leu 
130 135 140 

Gin Ser Val Cys Cys Asp lie Phe Pro Leu lie Asp Gly Thr Tyr Leu 
10 145 150 155 160 

Met Phe Trp lie Gly Val Thr Ser Val Leu Leu Leu Phe II.; Val Tyr 
165 170 175 

Ala Tyr Met Tyr lie Leu Trp Lys Ala His Ser His Ala Val Arg Ala 
180 185 190 

15 Gin Arg Gly Thr Gin Lys Ser lie lie lie His Thr Ser Glu Asp Gly 

iy b 200 205 

Lys Val Gin Val Thr Arg Pro Asp Gin Ala Arg Met Asp lie Arg Leu 
210 215 220 

Ala Lys Thr Leu Val Leu lie Leu Val Val Leu lie lie Cys Trp Gly 
20 225 230 235 240 

Pro Leu Leu Ala lie Met Val Tyr Asp Val Phe Gly Leu Leu lie Lys 
245 250 255 

Thr Val Phe Ala Phe Cys Ser Leu Leu lie Asn Ser Thr Val Asn Pro 
260 265 270 

25 lie lie Tyr Ala Leu Arg Ser Lys Asp Leu Arg His Ala Phe Arg Ser 

275 280 285 

Trp Pro Ser Cys Glu Gly Thr Ala Gin Pro Leu Asp Asn Ser Met Gly 

290 295 300 

Asp Ser Asp Cys Leu His Lys His Ala Asn Asn Thr Ala Ser Met His 
30 305 310 315 320 

Arg Ala Ala Glu Ser Cys lie Lys Ser Thr Val Lys Leu Ala Leu Val 

325 330 335 

Ser Thr Asp Thr Ser Ala Glu Ala Leu 

340 345 

3 5 (2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 349 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

4 0 ( D ) TOPOLOGY : 1 inear 

f i i ) MOLECULE TYPE: peptide 



'Jvs His 
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lie Ser Ala Thr Ser Leu Phe lie Val Asn Leu Ala Val Ala Asp lie 
35 40 45 

lie Glu Thr Leu Leu Asn Thr Pro Phe Thr Leu Val Arg Phe Val Asn 

50 55 60 

5 Ser Thr Trp Tyr Phe Gly Lys Gly Met Leu His Val Ser Arg Phe Ala 

65 70 75 80 

Gin Tyr Cys Ser Leu His Val Ser Ala Leu He Leu Thr Ala He Ala 
85 90 95 

Val Asp Arg His Gin Val He Met Pro Leu Lys Pro Arg He Ser He 
10 100 105 110 

Thr Lys Gly Val He Tyr He Ala Val lie Trp Val Met Thr Phe Phe 
115 120 125 

Ser Leu Pro His Ala He Cys Gin Lys Leu Phe Thr Phe Lys Tyr Ser 
130 135 140 

15 Glu Asp He Val Arg Ser Leu Cys Leu Asp Pro Phe Pro Glu Pro Ala 

14 5 150 155 160 

Asp Leu Phe Trp Lys Tyr Leu Asp He Ala Thr Phe lie Leu Leu Tyr 
165 170 175 

Leu Leu Pro Leu Phe lie lie Ser Val Ala Tyr Ala Arg Val Ala Lys 
20 180 185 190 

Lys Leu Trp Leu Cys Asn Thr lie Gly Asp Val Thr Thr Glu Gin Tyr 
195 200 205 

Leu Ala Leu Arg Arg Lys Lys Lys Thr Thr Val Lys Met Leu Val Leu 
210 215 220 

25 v al Val Val Leu Phe Ala Leu Cys Trp Phe Pro Leu Asn Cys Tyr Val 

225 230 235 240 

Leu Leu Leu Ser Ser Lys Ala lie His Thr Asn Asn Ala Leu Tyr Phe 
245 250 255 

Ala Phe His Trp Phe Ala Met Ser Ser Thr Cys Tyr Asn Pro Phe lie 
30 260 265 270 

Tyr Cys Trp Leu Asn Glu Asn Phe Arg Val Glu Leu Lys Ala Leu Leu 
275 280 285 

Ser Met Gin Pro Pro Pro Lys Pro Glu Asp Arg Leu Pro Ser Pro Val 
290 295 300 

35 Prc Ser Phe Arg Val Ala Trp Thr Glu Lys Ser His Gly Arg Arg Ala 

3 °5 310 315 320 

Pro Leu Pro Asn His His Leu Pro Ser Ser Gin lie Gin Ser Gly Lys 
^25 330 335 

Thr Asp Leu Ser Ser Val Glu Pro Val Val Ala Met Ser 
40 340 345 

(2) INFORMATION FOR SEQ ID NO: 72: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 

4 5 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 mear 
(ii) MOLECULE TYPE : peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

He Phe Thr He Ala Leu Ala Tyr Gly Ala Val He He Leu Gly Val 
15 10 15 

Ser Gly Asn Leu Ala Leu He He He He Leu Lys Gin Lys Glu Leu 
5 20 25 30 

He Leu He Val Asn Leu Ser Phe Ser Asp Leu Leu Val Ala Val Trp 
35 40 45 

Leu Pro Phe Thr Phe Val Tyr Thr Leu He Cys His Trp Val Phe Gly 
50 55 60 

10 Glu Cys Cys Lys Leu Asn Pro Phe Val Gin Cys Val Ser He Thr Val 

65 70 75 80 

Ser He Phe Ser Leu Val Leu He Ala Val Glu Arg His GL. Leu He 
85 90 95 

He Asn Pro Arg Gly Trp Arg Pro Asn Asn Arg His Ala Tyr He Gly 
15 100 105 110 

He Thr Val He Trp Val lie Ala Val Ala Ser Ser Leu Pro Phe Val 
115 120 125 

He Tyr Gin He Leu Thr Asp Glu Pro Phe Gin Asn Val Ser Leu Ala 
130 135 140 

2 0 Ala Phe Lys Asp Lys Tyr Val Cys Phe Asp Lys Phe Pro Ser Asp Ser 

145 150 155 160 

His Arg Leu Ser Tyr Thr Thr Leu Leu Leu Val Leu Gin Tyr Phe Gly 
165 170 175 

Pro Leu Cys Phe He Phe He Cys Tyr Phe Lys He Tyr He Arg Leu 
25 180 185 190 

Lys Arg Arg Asn Asn Met Met Lys He Arg Asp Ser Lys Tyr Arg Ser 
195 200 205 

Ser Glu Thr Lys Arg He Asn Val Met Leu Leu Ser He Val Val Ala 
210 215 220 

3 0 Phe Ala Val Cys Trp Leu Pro Leu Thr He Phe Asn He Va: Phe Asp 

225 230 235 240 

Trp Asn His Gin lie lie Ala Thr Cys Asn His Asn Leu Leu Phe Leu 
245 250 255 

Leu Cys His Leu Thr Leu Ser Thr Cys Val Asn Pro He Phe Tyr Gly 
35 260 265 270 

Phe Leu Asn Lys Asn Phe Gin Arg Asp Leu Gin Phe Phe Phe Asn Phe 

275 280 285 

Cys Asp Phe Arg Ser Arg Asp Gly Arg Thr Thr Arg Leu 

2S0 295 300 



T PANDKDNE ^ - :, 

\ L . TOPOLOGi : linear 
MOLECULE TYPE: peptide 
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(xx) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Leu Thr Ser Val Val Phe He Leu He Cys Cys Phe He He Leu Glu 

1 5 10 15 

Asn He Phe Val Leu Leu Thr He Trp Lys Thr Lys Lys Phe His Arg 
20 25 30 

Pro Met Tyr Tyr Phe He Gly Asn He Ala Leu Ser Asp Leu He Ala 
35 40 45 

Gly Val Ala Tyr Thr Ala Asn Leu Leu Leu Ser Gly Ala Thr Thr Tvr 
50 55 60 

Lys Leu Thr Pro Ala Gin Trp Phe Leu Arg Glu Gly Ser Met Phe Val 
65 70 75 80 

Ala Leu Ser Leu Ser Val Phe Ser Leu Leu Ala lie Ala He Glu Arg 
85 90 95 

Tyr He Thr Met Leu Lys Met Leu His Asn Gly Ser Asn Asn Phe Arg 
15 100 105 110 

Leu Phe Leu Leu He Ser Ala Cys Trp Val He Ser Leu He Leu Gly 
115 120 125 



10 



20 



Gly Leu Pro He Met Gly Trp Asn Cys He Ser Ala Leu Ser Ser Cys 
130 135 140 

Ser Thr Val Leu Pro Leu Tyr His Lys His Tyr He Leu Phe Cys Thr 
145 150 155 160 

Leu He Val Phe Thr Leu Leu Leu Leu Ser He Val He Leu Tyr Cys 
165 170 175 

Arg He Tyr Ser Leu Val Arg Thr Arg Ser Arg Arg Leu Thr Phe Arg 
Zb 180 185 190 

Lys Asn He Ser Lys Ala Ser Arg Ser Ser Glu Asn Val Ala Leu Leu 
195 200 205 



30 



Lys Thr Val He He Val Leu Ser Val Phe He Ala Cys Trp Ala Pro 
210 215 220 

Leu Phe He Leu Leu Leu Leu Asp Val Gly Cys Lys Val Lys Thr Cys 
225 230 235 240 

Asp He Leu Phe Arg Ala Glu Tyr Phe Leu Val He Ala Val He Asn 
245 250 255 

Ser G1 Y Th ^ Asn Pro He He Tyr Thr Leu Thr Asn Lys Glu Met Arq 
35 260 265 270 

Arg Ala Phe He Arg lie Met Cys Cys Lys Cys Pro Ser Gly Asp Ser 
275 280 285 

Ala Gly Lys Phe Lys Arg Pro He lie Ala Gly Met Glu Phe Ser Arq 
290 295 300 

40 Ser L y s s er Asp Asn Ser Ser His Pro Gin Lys Asp Glu Gly Asp Asn 

3 °5 310 315 320 

Pro Glu Thr lie Met Ser Ser Gly Asn Val Asn Ser Ser Ser 
325 330 

(2) INFORMATION FOR SEQ ID NO : 74 : 
4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(11) MOLECULE TYPE: peptide 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

He Thr Tyr Tyr He Leu He Gly Leu Cys Ala Val Val Gly Asn He 

15 10 15 

Leu Leu Val He Trp Val Val Lys Leu Asn Arg Thr Leu Arg Thr Thr 

20 25 30 

10 Thr Phe Tyr Phe He Val Ser He Ala Leu Ala Asp He Ala Val Leu 

35 40 45 

Val He Pro Leu Ala He Ala Ser Ala Trp Arg Ser Arg Cys Thr Ser 

50 55 60 

Asn Cys Leu Phe Met Ser Cys Val Leu Leu Val Phe Thr His Ala Ser 
15 65 70 75 80 

He Met Ser Leu Leu Ala He Ala Val Asp Arg Tyr Leu Arg Val Lys 
ftS 90 95 

Leu Thr Val Arg Tyr Arg Thr Val Thr Thr Gin Arg Arg He Trp Leu 
100 105 110 

2 0 Phe Leu Gly Leu Cys Trp Leu Val Ser Phe Leu Val Gly Leu Thr Pro 

115 120 125 

Trp Gly Trp Asn Arg Lys Val Thr Leu Glu Leu Ser Gin Asn Ser Ser 
130 135 140 

Thr Leu Arg Glu Phe Lys Thr Pro Lys Ser Leu Phe Leu Val Leu Phe 
25 145 150 155 160 

Leu Phe Ala Leu Cys Trp Leu Pro Leu Ser He He Asn Phe Val Ser 
165 170 175 

Tyr Phe Asn Val Lys He Pro Glu Thr Leu Leu Gly He Leu Leu Ser 
180 185 190 

3 0 His Ala Asn Ser Leu Pro He Val Tyr Ala Cys Lys Lys Lys Phe Lys 

195 200 205 

Glu Thr Tyr Phe Val He Leu Arg Ala Cys Arg Leu Cys Gin Thr Ser 

210 215 220 

Asp Ser Leu Asp Ser Asn Leu Glu Gin Thr Thr Glu 
35 225 230 235 

(2) INFORMATION FOR SEQ ID NO: 75: 
<l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: ammo acid 

4 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

■ X .1 li> '\J lT ii! v „ H LJ i : j ^ hC _L ir' 1 iwN ' IdIL^' j. iVw " • T- 



Thr Al a Th r Asn He Tyr He Leu Asn He Al a He Al a Asp Glu Le u 
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35 40 45 

Leu Val Pro Phe Leu Val Thr Ser Thr Leu Leu Arg His Trp Pro Phe 
50 55 60 

Gly Ala Leu Leu Cys Arg Leu Val Leu Ser Val Asp Ala Val Asn Met 

65 70 75 80 

Phe Thr Ser lie Tyr Cys Leu Thr Val Leu Ser Val Asp Arg Tyr Val 
85 90 95 

Ala Val Val His Pro lie Lys Ala Ala Arg Tyr Arg Arg Pro Thr Val 
100 105 110 

Ala Lys Val Val Asn Leu Gly Val Trp Val Leu Ser Leu Leu Val lie 

115 120 125 

Leu Pro lie Trp Phe Ser Arg Thr Ala Ala Asn Ser Asp Gly Thr Val 
130 135 140 

Ala Cys Asn Met lie Trp Glu Pro Ala Gin Phe Trp Leu Vai Gly Phe 
15 14 5 150 155 160 

Val Leu Tyr Thr Phe Leu Met Phe Leu Leu Pro Val Gly Ala lie Cys 
165 170 175 



10 



20 



Leu Cys Tyr Val Leu lie lie Ala Lys Met Arg Met Val Ala Leu Lys 
180 185 190 

Ala Gly Trp Gin Gin Arg Lys Arg Ser Glu Arg Lys lie Thr Leu Val 
195 200 205 

Met Met Val Val Met Val Phe Val lie Cys Trp Phe Tyr Val Val Gin 
210 215 220 

L eu Val Asn Val Phe Ala Glu Gin Asp Asp Ala Thr Val Ser Gin Leu 

25 2 25 230 235 240 

Ser Val lie Leu Gly Tyr Ala Asn Ser Cys Ala Asn Pro lie Leu Tyr 
245 250 255 



30 



Gly Phe Leu Ser Asp Asn Phe Lys Arg Ser Phe Gin Arg lie Leu Cys 
260 265 270- 

Leu Ser Leu Asn Ala Ala Glu Glu Pro Val Asp Tyr Tyr Ala Thr Ala 
275 280 285 

Leu Lys Ser Arg Ala Tyr Ser Val Glu Asp Phe Gin Pro Gla Asn Leu 
290 295 300 

Glu Ser Gly Gly Val Phe Arg Asn Cys Thr Cys Ala Ser Arg lie Ser 
35 305 310 315 320 

Thr Leu 

(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 298 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

45 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Val Thr Asn Tyr lie Phe Leu Leu Leu Cys Leu Cys Gly Leu Val Gly 

15 10 15 
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Asn Gly Leu Val Leu Trp Phe Phe Gly Phe Ser lie Lys Arg Thr Pro 

20 25 30 

Phe Ser He Tyr He Tyr Phe Leu His He Ala Ser Ala Asp Gly He 
35 40 45 

5 Tyr Leu Phe Ser Lys Ala Val He Ala Leu Leu Asn Met Gly Thr Phe 

50 55 60 

Leu Gly Ser Phe Pro Asp Tyr Val Arg Arg Val Ser Arg He Val Gly 

65 70 75 80 

Leu Thr Phe Phe Ala Gly Val Ser Leu Leu Pro Ala He Ser He Glu 
10 85 90 95 

Arg Cys Val Ser Val He Phe Pro Met Trp Tyr Trp Arg Arg Arg Pro 
100 105 110 

Lys Arg Leu Ser Ala Gly Val Cys Ala Leu Leu Trp Leu Leu Ser Phe 
115 120 125 

15 Leu Val Thr Ser He His Asn Tyr Phe Cys Leu Leu Gly His Glu Ala 

130 135 140 

Ser Gly Thr Ala Cys Leu Asn Met Asp He Ser Leu Leu Gly He Leu 
145 150 155 160 

Leu Phe Phe Leu Phe Cys Pro He Met Val Leu Pro Cys He Ala Leu 
20 165 170 175 

Leu His Val Glu Cys Arg Ala Arg Arg Arg Gin Arg Ser Ale Lys Leu 
180 185 190 

Asn His Val Val Leu Ala He Val Ser Val Phe Leu Val Ser Ser He 
195 200 205 

2 5 Tyr Leu Gly He Asp Trp Phe Leu Phe Trp Val Phe Gin He Pro Ala 

210 215 220 

Pro Phe Pro Glu Tyr Val Arg Asp Leu Cys He Cys He Asn Ser Ser 
225 230 235 240 

Ala Lys Pro He Val Tyr Phe He Ala Gly Arg Asp Lys Ser Gin Arg 
30 245 250 255 

Leu Trp Glu Pro Leu Arg Val Val Phe Gin Arg Ala Leu Arg Asp Gly 

260 265 270 

Ala Glu Pro Gly Asp Ala Ala Ser Ser Thr Pro Asn Thr Val Thr Met 

275 280 285 

3 5 Glu Met Gin Cys Pro Ser Gly Asn Ala Ser 

290 295 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS ■ single 



Ala He Val Leu He Thr Gin Leu Leu Thr Asn Arc Val 



Leu Gly Tyr 
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20 25 30 

Ser Thr Pro Thr lie Tyr Met Arg Asn Leu Tyr Ser Thr Asn Phe Leu 
35 40 45 

Thr Leu Thr Val Leu Pro Phe lie Val Leu Ser Asn Gin Trp Leu Leu 

5 50 55 60 

Pro Ala Cys Tyr Val Ala Ser Cys Lys Phe Leu Ser Val lie Tyr Tyr 
65 70 75 80 

Ser Ser Cys Thr Val Gly Phe Ala Thr Val Ala Leu He Ala Ala Asp 
85 90 95 

10 Arg Tyr Arg Val Leu His Lys Arg Thr Tyr Ala Arg Gin Ser Tyr Arg 

100 105 110 

Ser Leu Leu Leu Thr Trp Leu Ala Gly Leu He Phe Ser Val Pro Ala 
115 120 125 

Ala Val Tyr Thr Thr Val Val Met His His Asp Ala Asn Asp Thr Asn 
15 130 135 140 

Asn Thr Asn Gly His Ala Thr Cys Val Leu Tyr Phe Val Ala Glu Glu 
145 150 155 160 

Val His Thr Val Leu Leu Ser Trp Lys Val Leu Leu Thr Met Val Trp 
165 170 175 

2 0 Gly Ala Ala Pro Val He Leu Phe Tyr Ala Phe Phe Tyr Se- Thr Val 

180 185 190 

Gin Arg Thr Ser Gin Lys Gin Arg Ser Arg Thr Leu Thr Phe Val Ser 
195 200 205 

Val Leu Leu He Ser Phe Val Ala Leu Gin Thr Pro Tyr Val Ser Leu 
25 210 215 220 

Met He Phe Asn Ser Tyr Ala Thr Thr Ala Trp Pro Met Cys Glu His 
225 230 235 240 

Leu Thr Leu Arg Arg Thr He Gly Thr Leu Ala Arg Val Val Pro His 
245 250 255 

3 0 Leu His Cys Leu He Asn Pro He Leu Tyr Ala Leu Leu Cys His Asp 

260 265 270 

Phe Leu Gin Arg Met Arg Gin Cys Phe Arg Gly Gin Leu He Asp Arg 
275 280 285 

Ala Phe Leu Arg Ser Gin Gin Asn Gin Arg Ala 
35 290 295 

(2) INFORMATION FOR SEQ ID NO: 78: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS : single 

( D ) TO POLOGY : linear 
<ii) MOLECULE TYPE: peptide 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 7 8 : 

Leu Gly Val Trp Leu Met He Val Gly Thr Phe Leu Leu Val He Thr 
45 l 5 10 15 

Thr He Leu Tyr Tyr Arg Arg Lys Lys Lys Ser Pro Ser Asd Thr Tyr 

2 0 25 30* 
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lie Cys Asn Leu Ala Val Ala Asp Leu Leu lie Val Val Gly Leu Pro 
35 40 45 

Phe Phe Leu Glu Tyr Ala Lys His His Pro Lys Leu Ser Arg Glu Val 
50 55 60 

5 Val Cvs Ser Gly Leu Asn Ala Cys Phe Tyr lie Cys Leu Phe Ala Gly 

65 " 70 75 80 

Val Cys Phe Leu lie Asn Leu Ser Met Asp Arg Tyr Cys Val lie Val 
85 90 95 

Trp Gly Val Glu Leu Asn Arg Val Arg Asn Asn Lys Arg Ala Thr Cys 
10 100 105 110 

Trp Val Val lie Phe Trp lie lie Ala Val Leu Met Gly Met Pro His 
115 120 125 

Tyr lie Met Tyr Ser His Thr Asn Asn Glu Cys Val Gly Trp Phe Ala 
130 135 140 

15 Asn Glu Thr Ser Cys Trp Phe Pro Val Phe Leu Asn Thr Ly.- Val Asn 

145 150 155 160 

lie Cys Gly Tyr Leu Ala Pro lie Ala Leu Met Ala Tyr Tyr Asn Arg 
165 170 175 

Met Val Arg Phe lie lie Asn Tyr Val Gly Lys Trp Phe Met Gin Thr 
20 180 185 190 

Leu His Val Leu Leu Val Val Val Val Ser Phe Ala Ser Phe Trp Phe 
195 200 205 

Pro Phe Asn Leu Ala Leu Phe Leu Glu Ser lie Arg Leu lie Ala Gly 
210 215 220 

25 Val Tyr Asn Asp Thr Leu Gin Asn Val He He Phe Cys Leu Tyr Val 

225 230 235 240 

Gly Gin Phe He Ala Tyr Val Arg Ala Cys Leu Asn Pro Gly He Tyr 
245 250 255 

He Leu Val Cys Thr Trp Phe Leu Arg Val Phe Ala Cys Cys Cys Val 
30 260 265 270 

Lys Gin Glu He Pro Tyr Gin Asp He Asp He 
275 280 

(2) INFORMATION FOR SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
{ii} MOLECULE TYPE: peptide 

4 0 (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Pro Val Thr Leu Phe Leu Tyr Gly Val Val Phe Leu Phe Gly Ser He 



Phe Val Cys Thr Leu Pro Leu Trp Met Gin Tyr Leu Leu Asc His Asn 
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50 55 60 

Ser Leu Ala Ser Leu lie Pro Cys Thr Leu Leu Thr Ala Cys Phe Tyr 
65 70 75 80 

Val Ala lie Thr Ala Ser Leu Cys Phe lie Thr Glu lie Ala Leu lie 
5 85 90 95 

Asp Arg Tyr Tyr Ala lie Val Tyr Met Arg Tyr Arg Pro Val Lys lie 
100 105 110 

Gin Ala Cys Leu Phe Ser lie Phe Trp Trp lie Phe Ala Val lie lie 
115 120 125 

10 Ala lie Pro His Phe Met Val Val lie Thr Lys Lys Asp Asn Gin Cys 

130 135 140 

Met Thr Asp Tyr Asp Tyr Leu Glu Val Ser Tyr Pro lie lie Leu Asn 
145 150 155 160 

Val Glu Leu Met Leu Gly Ala Phe Val lie Pro Leu Ser Val lie Ser 
15 165 170 175 

Tyr Cys Tyr Tyr Arg lie Ser Arg lie Val Ala Val Ser Gin Ser Arg 
180 185 190 

His Lys Gly Arg lie Val Arg Val Leu He Ala Trp Leu Val Phe He 
195 200 205 

2 0 He Phe Trp Leu Pro Tyr His Leu Thr Leu Phe Val Asp Thr He lie 

210 215 220 

Lys Leu Leu Lys Trp He Ser Ser Ser Cys Glu Phe Glu Arg Ser Leu 
225 230 235 240 

Lys Arg Ala Leu He Leu Thr Glu Ser Leu Ala Phe Cys His Cys Cys 
25 245 250 255 

Leu Asn Pro Leu Leu Tyr Val Phe Val He Gly Thr Lys Phe Arg Lys 
260 265 270 

Asn Tyr Thr Val Cys Trp Pro Ser Phe Ala Ser Asp Ser Phe Pro Ala 
275 280 285 

3 0 Met Tyr Pro Gly Thr Arg Ala 

290 295 

(2) INFORMATION FOR SEQ ID NO: 80: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

4 0 Asp Asp Asp Asp Asn He Trp Ser He Phe Asp Trp He Gly Tyr Leu 

15 10 15 

Asn Ser He Ser Met Val He Tyr Thr Leu Phe Lys Lys Lys Lys 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 81: 
4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: linear 
di) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Asp Asp Asp Asp Asn lie Trp Asn lie Phe Ser Thr lie Gly Tyr Leu 
5 15 10 15 

Asn Ser lie Ser Pro Val Ser Val lie Met His lie Tyr Gly Lys Lys 
20 25 30 

Lys Lys 



10 (2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 ( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Asp Asp Asp Asp Gly Tyr Ser lie Tyr Asp Thr Leu Val Thr Phe Ala 

15 10 15 

2 0 lie Asn Pro Val Tyr lie Thr Val Phe Lys Lys Lys Lys 

20 25 

(2) INFORMATION FOR SEQ ID NO:83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

3 0 Asp Asp Asp Asp Asn Ala Trp Ser Ala Phe Asp Trp Ala Leu Tyr Leu 

15 10 15 

Asn Ser lie Ser Met Ala lie Tyr Thr Tyr Ala Lys Lys Lys Lys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO : 84 : 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

4 0 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Leu Phe Ser Phe lie Thr Trp Leu Gly Tyr Ala Asn Ser Ser Leu Asn 
15 10 15 

Pro lie lie Tyr Thr Thr Phe 
45 20 

(2) INFORMATION FOR SEQ ID NO: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

5 0 (C) STRANDEDNESS: single 



lie Met Val Zle Thr Tvr Thr 

20 
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(2) INFORMATION FOR SEQ ID NO: 86: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(li) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

lie Trp Leu Thr Ser Asp lie Met Ser Thr Ser Ser lie Leu His Asn 
10 1 5 10 15 

Leu Cys Val lie Ser Phe 
20 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

lie Trp Ser lie Phe Ser Ser Asp lie Val Val Gly Tyr Ala Asn His 
15 10 15 

Ser Ser Leu Ala lie Met Cys Pro lie Val lie Tyr Thr Va- 
20 25 30 

2 5 (2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

lie Phe Thr lie Phe Ser Ser Asp lie Ala Val Gly Tyr Ala Asn His 

15 10 15 

3 5 Ser Ser Ala Ala lie Met Pro lie Val lie Tyr Ser Val 

20 25 

(2) INFORMATION FOR SEQ ID NO: 89: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
45 Lys Asn Ala Ser Ala Leu Leu Ser Val lie lie lie Asn Ser lie Gly 

15 10 15 

Gly Asn Val Val Thr Ala Val Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 

5 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
5 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Tyr Phe Leu Met Ser Leu Ala Val Thr Asp Leu Val Val Ser Phe Val 

1 S 10 15 
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Met Pro Val Ser Ala Leu 

20 

(2) INFORMATION FOR SEQ ID NO : 9 1 : 
(l) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 3 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

Ala lie Thr Lys lie Ala lie Thr Trp Ala lie Ser Gly Val Ser Val 
15 10 15 

Pro Phe lie Pro Val Trp Gly 
20 

15 (2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
2 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

Leu Gly lie lie Phe Gly Thr Phe lie He He Trp Leu Pr- Phe Phe 

15 10 15 

2 5 He Thr Asn Leu Val Ser Pro He 

20 

(2) INFORMATION FOR SEQ ID NO: 93: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

3 0 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

3 5 He Trp He Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ser He Met 

15 10 15 

His Leu Cys Ala He Ser Leu 
20 

(2) INFORMATION FOR SEQ ID NO:94: 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

4 5 (ii) MOLECULE TYPE: peptide 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Gly Tyr Thr He Tyr Ser Thr Leu Val Thr Phe Tyr He Pro Ser Val 

1 - 5 10 15 

He Met Val He Thr Tyr Gly 

5 0 2 0 



(D; TOPOLOGY : linear 
.ii, MOLECULE TYPE: peptide 

txi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 5 : 
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Leu Leu Asn Phe Phe Asn Trp lie Gly Tyr Leu Asn Ser Leu lie Asn 
15 10 15 

Pro Val lie Tyr Thr Leu Phe 

20 
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WHAT IS CLAIMED IS: 

1. A G-protein coupled receptor polypeptide, consisting 
essentially of an amino acid sequence of 15 to 4 0 amino acids 
substantially corresponding to a fragment or consensus peptide of a 

5 transmembrane domain of a G-protein coupled receptor, wherein said 
polypeptide has a GPR- related biological activity selected from 
binding a GPR ligand or modulating GPR ligand binding to a GPR. 

2. A polypeptide according to claim 1, wherein said 
polypeptide is selected from a synthetic polypeptide, a recombinant 

10 polypeptide or a purified polypeptide. 

3. A polypeptide according to claim 1, wherein said G- 
protein coupled receptor is a receptor selected from a cAMP receptor, 
an adenosine receptor, a .^-adrenergic receptor, a muscarinic 
acetylcholine receptor, an a- adrenergic receptor, a serotonin 

15 receptor, a histamine H2 receptor, a thrombin receptor, a kinin 
receptor, a follicle stimulating hormone receptor, an opsin, a 
rhodopsin, an odorant receptor, a cytomegalovirus receptor, or a mas 
oncogene GPR . 

4 . A polypeptide according to claim 1 , wherein said 
2 0 transmembrane domain is selected from at least one of transmembrane 

domain TMl , TM2 , TM3 , TM4 , TM5 , TM6 or TM7 . 

5 . A polypeptide according to claim 3 , wherein said 
transmembrane domain is a D, receptor transmembrane segment III or 
segment V. 

25 6. A polypeptide according to claim 4, wherein said 

polypeptide has the amino acid sequence of Fig. 2 (SEQ ID NO: 2 ) . 

7. A polypeptide according to claim 4, wherein said 
polypeptide has the amino acid sequence of Fig. 3 (SEQ ID NO : 3 ) . 

8 . A polypeptide according to claim 4 , wherein said 
30 polypeptide has an amino acid sequence selected from one of SEQ ID 

NOS : 80-95 . 

9. A polypeptide according to claim 4, wherein said 
polypeptide has an amino acid sequence of one of SEQ ID NOS: 96 -348. 



WO 94/05695 




PCT/US93/08528 



- 140 - 

11. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS-.226- 
289 . 

12. A polypeptide according to claim 9, wherein said 
5 polypeptide has an amino acid sequence from one of SEQ ID NOS:290- 

297 . 

13. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:298- 
324 . 

10 14. A polypeptide according to claim 9, wherein said 

polypeptide has an amino acid sequence from one of SEQ ID NOS:325- 
338 . 

15. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:339- 

15 348. 

16. A polypeptide according to claim 3, wherein said 
transmembrane domain is a dopamine receptor transmembrane domain 
selected from the group consisting of a D 1# D 2i D 3 , D 4 or D 5 
transmembrane domain . 

20 17. A composition comprising a polypeptide according to 

claim 1, or a pharmaceut ically acceptable ester, ether, sulfate, 
carbonate, glucuronide or salt thereof, and a pharmaceut ically 
acceptable carrier . 

18. A composition according to claim 16, wherein said 
25 transmembrane domain is D 2 receptor transmembrane segment III or 

segment V. 

19. A composition according to claim 18, further 
comprising a drug selected from a phenothiazine derivative, a 
thioxanthine derivative, a butyrophenone derivative, a 

3 0 dihydroindolone , a dibenzoxazepine derivative and an atypical 
neuroleptic . 

20. A method for treating a subject suffering from a 
pathology related to an abnormality of a G-protein coupled receptor, 
comprising administering to said subject a therapeutically effective 

35 amount of composition according to claim 16. 

21. The method of claim 20, wherein said pathology is a 
psychotic disorder . 
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2 2 . The method of claim 21 , wherein said psychotic 
disorder is a schizophrenia. 

23 . The method of claim 20 , wherein said composition is 
administered to provide said polypeptide, fragment or consensus 

5 peptide thereof, in an amount ranging from about 0.01 fig to 100 mg/kg 
per day . 

24 . The method of claim 23 , wherein said composition is 
administered to provide said polypeptide, fragment or consensus 
peptide thereof, in an amount ranging from about 10 fig to 10 mg/kg per 

1 0 day . 

25. The method of claim 20, wherein said administering is 
by oral, mucosal, intravenous, intramuscular or parenteral 
administration . 

26. A method for producing a polypeptide according to 
15 claim 1 , wherein said polypeptide is a recombinant polypeptide 

obtained from a recombinant host which expresses a heterologous 
nucleic acid encoding said polypeptide, comprising the steps of: 

(A) providing a host comprising a recombinant nucleic 
acid encoding a polypeptide according to claim 1 in 

2 0 expressible form; 

(B) culturing said host under conditions such that 
said polypeptide is expressed in recoverable amounts; and 

(C) recovering said polypeptide produced by said 

host . 

25 27. The method of claim 26, further comprising: 

(D) purifying said polypeptide. 

28. The method of claim 26, wherein said host is a 
bacteria or a eukaryotic cell. 

29. The method of claim 28, wherein said eukaryotic cell 

3 0 is a mammalian cell , an insect cell or a yeast cell . 

30. A method for producing a polypeptide according to 
claim 1 , comprising : 

(A) chemically synthesizing a polypeptide according 
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31. A method for isolating a G-protein coupled receptor, 
fragment or consensus sequence thereof, or a protein that binds the 
G-protein coupled receptor, comprising 

(A) providing a bound support, said support being 
5 bound to a polypeptide according to claim 1, or an 

antibody, ant i - idiotype antibody, or a fragment thereof; 

(B) contacting a sample containing said G-protein 
coupled receptor or said protein that binds a G-protein 
coupled receptor to said bound support, such that said 

10 receptor or protein is reversibly bound to said bound 

support ; and 

(C) recovering said receptor or protein that is 
attached to the bound support by dissociating the receptor 
or protein under conditions that cause elution or 

15 dissociation of the receptor or protein from said bound 

support . 

32. A method according to claim 31, wherein said GPR is 
a dopamine receptor . 

33. An antibody, anti - idiotype antibody or a fragment of 
20 said antibody or anti - idiotype antibody, that specifically displays 

an epitope of a G-protein coupled receptor polypeptide, according to 
claim 1. 

34 . A recombinant nucleic acid comprising a nucleotide 
sequence encoding a G-protein coupled receptor polypeptide according 

25 to claim 1. 

35. A vector comprising a nucleic acid according to claim 

34 . 

36. A host cell comprising the nucleic acid of claim 34. 

37. A host cell according to claim 36, wherein said host 
30 cell is selected from a mammalian cell, a yeast cell, a bird cell or 

an insect cell. 

38. A host cell according to claim 36, wherein, when said 
nucleic acid is expressed as said receptor polypeptide in said host 
cell, a receptor binding molecule comprising said env binding domain 

35 binds to said receptor polypeptide. 
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39. A host cell according to claim 37, wherein said host 
cell is a mammalian cell selected from a human cell, a primate cell 
or a rodent cell. 

40. A method for isolating a protein that binds a 
G-protein coupled receptor, comprising 

(A) providing a bound support, said support being 
bound to a polypeptide according to claim 1, or 
anti - idiotype antibody thereto; 

(B) contacting a sample containing said protein that 
binds a G-protein coupled receptor to said bound support, 
such that said protein is reversibly bound to said bound 
support ; and 

(C) recovering said protein that is attached to the 
bound support by dissociating the receptor or protein under 
conditions that cause elution or dissociation of the 
protein from said bound support. 

41. A method according to claim 40, wherein said GPR is 
a dopamine receptor. 
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